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Recognition of antigens by single- 
domain antibody fragments: the 
superfluous luxury of paired domains 

Serge Muyldermans, Christian Cambillaii and Lode Wyns 



The antigen-binding site of antibodies from vertebrates is formed by combining 
the variable domains of a heavy chain (VH) and a light chain (VL). However, 
antibodies from camels and llamas are an important exception to this in that 
their sera contain, in addition, a.unique kind of antibody that is formed by 
heavy chains only. The antigen-binding site of these antibodies consists of one 
single domain, referred to as VHH. This article reviews the mutations and 
structural adaptations that have taken place to reshape a VH of a VH-VL pair 
into a single-domain VHH with retention of a sufficient variability. The VHH has 
a potent antigen-binding capacity and provides the advantage of interacting 
with novel epitopes that are inaccessible to conventional VH-VL pairs. 

Probably the fastest and most efficient protein- 
engineering task is performed daily by the immune 
system of vertebrates. The immune system produces 
large amounts of highly specific adapter molecules - 
known as immunoglobulins or antibodies - that are 
raised to virtually all possible foreign molecules, 
either small or large. 



The antigen-binding site of antibodies 

The efficient design of such 'emergency' molecules 
relies on the possibility of generating an enormous 
number of different antibody molecules as B-cell 
' receptors. The B cells carrying an antibody that 
recognizes the antigen through surface and charge . 
complementarity will proliferate and evolve into cells 
that produce soluble antibodies. Despite having the 
capacity to produce a nearly unlimited variation in 
antigen-binding surfaces, the antibody molecules 
have a composition and overall structure that is 
remarkably well conserved throughout the 
vertebrate phylum. 

Antibodies are formed by two identical heavy and 
two identical light polypeptide chains, folded in four 
and two domains, respectively 1 (Fig. la). The 
N- terminal domain of each chain is more variable in 
sequence than the others. Sequence variations in the 
variable domain of the heavy and light chain (VH and 
VL, respectively) can be introduced at any of the 
multiple stages of antibody formation 2 (Box 1). ■ 
Sequence comparison of the various VH or VL 
domains indicated that there are six regions [three in 
VH (Fig. 2) and three in VL] in which the amino acid 
sequence is more variable than the remainder of the 
sequence. It was immediately hypothesized that 
these hypervariable regions would interact with the 
antigen, and they were therefore named CDRs, for 
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complementarity determining regions. (Within each 
domain, the CDRs are numbered 1-3 following their 
occurrence in sequence.) Indeed, crystallographic 
data confirmed that the folded VH and VL domains 
associate so that the six hypervariable loops 
juxtapose atone end of the molecule to form a 
continuous surface of -1000 A J . Antibody-antigen 
complexes showed that 190-350 A 2 or 400-900 A 2 of 
this surface actually interacts with haptens or 
proteinaceous antigens, respectively 3 . 

All CDRs can potentially make contact with the 
antigen, although very seldom all at the same time for 
anti-hapten or anti-peptide antibodies. The antigen 
contacts made by CDR3 are generally more extensive. 
These C-terminally located hypervariable regions are 
the most variable in sequence and length* because 
they contain amino acids encoded by codons that are 
generated by V(D) J recombination, including its 
imprecise joinings [see Box 1 for V(D)J 
recombination]. An 'antigen-eye view' of the antigen- 
binding site reveals that the highest variability is at 
the centre of the site, where the CDR3 loops of VH and 
VL cluster. Most likely, these loops will dictate the 
antigen specificity. The somatic mutations introduced 
at a later stage, during the affinity maturation 
towards a specific antigen, seem to occur preferentially 
at the periphery of the antigen-binding site 5 . The size 
of the third hypervariable loop of VH, in conjunction 
with the flexible association of VH and VL at various 
angles and distances, generates a structural diversity 
ofbinding sites that can be grouped into three major 
classes. These are schematically described as cavities, 
grooves and planar sites, and they correspond to the 
size and type (hapten, peptide and protein, 
respectively) of the antigen 6 . 



Canonical loop si 

At a structural level, the CDRs fold into one of a 
limited number of so-called canonical structures 7 . On 
the basis of the sequence similarity of key sites in the 
CDR, it was proposed that the same canonical 
structures would also shape the antigen-binding site 
of the cartilaginous fish antibodies 8 . These fish are 
. the most distantly related species to human (of those 
species that are known to have an immune system), 
which indicates that the canonical loop structures 
arose early in the evolution of the immune system and 



http://tibs.trends.corr. O968-0O04/O1/S -see from matter © 3001 Elsevier Science Ud. All rights i 



PII:S096B.0004(Ot]01790-X 



s Vo!.26No,4 April 2001 




Fig. 1. Schematic representation of (a) a classical antibody, (b) a heavy-chain antibody of camel ids, 
and (c and d) their respective antigen-binding fragments. The antigen-binding site of the VH-VL pair 
or of the VHH is denoted by the red boxes. The yellow bars in b and d indicate a frequently occurring 
inter-loop disulphide bond within the VHH. Abbreviations: Ab, antibody; CH1-3, first, second and third 
constant heavy-chain domains; CL, constant light-chain domain; Fv, variable fragment; VH, variable 
domain of heavy chain; VHH, variable domain of heavy chain of heavy-chain antibody; VI, variable 
domainofSight chain. 

were conserved thereafter. The canonical loop 
structures were defined originally by the Ca 
positions, but later analysis also considered the 
peptide backbone and this led to the subdivision 
of some canonical structures into subtypes 9 . 
The comparison between the sequence and 
crystallographic structures resulted in algorithms 



that allow the prediction of the conformation of five 
. out of the six hypervariable loops 1011 ; the CDR3 of 
VH, the most variable CDR in length and amino acid 
composition, is more difficult to predict. New 
crystallographic data has provided novel insights into 
this loop structure 12 - 18 , but it remains inadvisable to 
make sweeping generalizations regarding CDR3 loop 
conformations given that there are ~10 n different 
antibody specificities of which only very few 
. complexes have been investigated so far 3 . 

Heavy-chain antibodies 

In the natural world, the generation of functional 
antibody genes and the final composition of 
antibodies are seemingly so adequate that it is 
difficult to imagine improvements to, or even 
deviations from, the natural state. But surprisingly, 
the Tylopoda (camels, dromedaries and llamas) have 
developed an additional antibody molecule with a 
homodimeric heavy-chain composition that is devoid 
of light chains 14 ; such immunoglobulins are called 
'heavy chain antibodies* (Fig. lb). Consequently, the 
antigen-binding fragment of these heavy-chain 
antibodies is confined to one single domain (i.e. the 
variable domain referred to as VHH for variable 
domain of the heavy chain of a heavy-chain antibody) 
(Fig. Id), instead of the paired VH and VL domains 
(Fig, lc). The immunization of camelids showed that 
the response in conventional or heavy-chain IgG 
depended on the type of antigen 13 . 

VH and VHH sequence difference 

Obviously, the amino acid sequences of the variable 
domain of the naturally occurring heavy-chain 
antibodies would be expected to acquire important 
adaptations to compensate for the absence of 
association with the light-chain variable domain. 
Nevertheless, the VH and VHH amino acid sequences 



Box 1. Major mechanisms to build up the diversity of antigen-binding sites 



A quasi-infinite number of different antibodies can be 
generated, each having a unique antigen-binding site 
formed by the N-terminal domain of the heavy and 
light chains 3 . Several distinct mechanisms are at the 
origin of this antigen-binding site diversity. First, a 
recombinatorial diversity is obtained by random 
selection of one variable heavy-chain gene ( VH), one 
diversity gene (D) and one heavy joining (JH) 
minigene, or one.variable light-chain gene (VL) and 
one light joining (JL) gene segment out of a pool, to 
constitute the VH and VL domains, respectively! 
Second, a junctional diversity is added by the 
imprecise joining mechanisms and by deletion or 
addition of random nucleotides at the borders of the . 
recombining VH-D-JH minigenes. Third, a 
combinatorial diversity created by the assembly of the 
VH and the VL domain completes the antigen-binding 
site. Fourth, the architecture of the antigen-binding 



site isenlarged by adjusting the angle between the 
associated VL and VH domains 3 . 

The primary antigen-binding site then benefits 
from a specific maturation event by the acquisition of 
somatic hypermutations that improve the shape 
complementarity of the antibody with the target 
antigen. The underlying mechanism of the somatic 
hypermutation is still unclear; however, it is well 
established that it is dictated by DNA hotspots such as 
AGY and TAY (Ref. b). The human immune system is 
also ha messed .with a powerful selection mechanism so 
that on|y the cells producing the best binders survive. 
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Fig. 2. Alignment of VHH sequences with known crystal log raphic structure and of one huma rt 
va ri able domain of heavy chain (VH H Pot VH|. The hype rva riable regions are shown in red, green and 
blue. The VHH hallmark amino acids are in pink,and the Cys residues involved in either an 
intradomain ',C22 and C92I or an inter-loop disulphide bond are highlighted in yellow. The two 
sequences at the top are against a hapten (azo-dye Reactive Red, RR6|; all other VHHs are directed 
against proteins. Abbreviations: AMYL, amylase; CA, carbonic anhydrase; hCG, humanchorionic 
gonadotropin hormone; LYS, lysozyme; RN. RNase A. 

share a high degree of identity (Fig. 2) and are most 
similar (~80%) to the human VH of family III 
(Ref. 16), the most common human VH family in 
terms of both expression and genome complexity". 
The amino acids at positions that determine the 
typical immunoglobulin fold 18 are all well conserved 
in the VHH. However, four amino acids that are 
extremely well conserved in all VHs are constitutively 
substituted in the VHH. These residues [Val37Phe 
(or Tyr), Gly44Glu (or Gin). Leu45Arg (or Cys) and 
Trp47Gly (or Ser, Leu, Phe) (Fig. 2)] discriminate the 
conventional VH from the heavy-chain specific VHH. 

Three hypervariable regions can be clearly 
distinguished in the VHH sequences, although the 
average variability of the remaining partsis increased 
relative to that in human or mouse VH (Ref. 19). In 
addition, the CDR3 is longer in VHH, on average, than 
' in VHs (17, 12 and 9 amino acids in dromedary VHHs, 
hum an VH and mouse VH, respectively 1 K ) . 

Dromedary HCAbs are generated from a limited 
ri u mber of d i verse V H H germline seg m ents 

The unique, functional, heavy-chain IgG antibodies 
occur (to the best of our knowledge) exclusively within 
the Tylopoda. It is expected that their appearance must 
be paralleled by gene adaptations, and altered gene 
organization and usage. Indeed, it seems that new and 
dedicated sets of immunoglobulin genes arose in the 
common ancestor of the camelids. Separate VH and 
VHH germline genes, probably residing within the ■ 
same locus, recombine with common D and JH gene 
segments to form a VH or VHH domain, respectively 1 : 1 . 
A limited number of VHH germline genes (-40) have 
been identified in the dromedary genome: this number 
is approximately half that of functional human VH 
genes 2 ". However, the repertoire of the primary VHH 
domains is apparently further diversified by active 
somatic mutation mechanisms 19 . 

The dromedary VHH germline genes encode a 
Cys residue in CDR1 (or in the framework region at 
position 45). Cys residues at these positions are absent 



in all VH germline genes including those of the 
dromedary 1 *. In addition, a second Cys, in the VHH 
CDR3, is introduced exclusively during the 
recombination of the VHH-D-JH genes. These 
additional Cys residues form an inter-loop disulphide 
bond that stabilizes the VHH domain 21 . Furthermore, 
this bond is expected to impose conformational 
restraints on the loop flexibility in the absence of 
antigen so that the entropic penalty upon antigen 
binding is minimized. 

VHH structure 

Polymerase chain reaction and phage display are 
routin'e techniques used to clone the antigen-binding 
modules (VH-VL pairs or VHHs; Figs led) from 
antibodies and to select antigen-specific binders 2 -. It 
has been shown repeatedly that the selected VHH 
fragments can be expressed extremely well as soluble 
proteins in bacteria and yeast- :< . Several of these 
recombinant VHHs directed against haptens or 
various proteins were crystallized with or without 
their antigen. This structural information confirmed 

I 




Fig. 3. The immunoglobulin fold of a VHH (R2-VHH)". The scaffold is 
i n yellow (arrows a re p stra nds), and the CDRs 1 , 2 and 3 a re in red, 
■green and blue, respectively. The hydrophobic cluster of Phe37, Phe<S7, 
TyrS1,Trp103 and the PhelOOh (g on fig > at three amino acids upstream 
from Trp 103 a re shown in purple. The VHH hallmarkamino acids Arg4B 
and Giu44at the 'VL-side' of the VHH domain are shown in cyan and 
red, respectively. 
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that the VHH scaffold adopts the typical 
immunoglobulin fold (Fig. 3) and superimposes 
perfectly on the conventional VH structure 24 " 27 . 
Furthermore, the crystal structures clarify the 
necessary adaptations that took place in the VHH 
domain to cope with the absence of a VL domain. 
These adaptations are concentrated in two areas: the 
'side' of the domain that corresponds to the VH-side 
contacting the VL, and the CDRs themselves. 

'VLside' 

The amino acids that constitute the VH-VL interface 
consist of both conserved residues and hypervariable 
residues 1 . It is suggested that the latter modulate the 
relative position of the VH and VL, potentially altering 
the specificity of the antibody. Removing the VL 
domain exposes a large hydrophobic patch of the VH to 
the aqueous solvent. The side chains of Val37, Gln39, 
Gly44,Leu45, Trp47, Tyr91 andTrpl03, residues of the 
conserved framework, have fixed positions in all 
VH-VL pairs and provide an interacting surface of 
-700 A' (Ref. 1). As expected, the exposure of such a 
large hydrophobic region to solvent leads to 
aggregation or stickiness of an isolated VH domain- 8 . 
The VHH-specific amino acid substitutions cluster in 
this region and render the area much more hydrophilic 
than it would be otherwise. This hydrophilicity is 
augmented further by rotation of the side chains of 
adjacent residues without deforming the Co backbone. 
For example, the Trpl03 side chain rotates over its 
Cp-Cybond to expose its most polar part, the Ne. to the 




Fig . 4. (a) The structure of the single-domain VHH in complex with lysozym e (yellow). The VHH is 
shown with its complementarity determining region (CDR)l, CDR2 and CDR3 in red, green and blue 
respectively. The shape complementarity between the convex paratope and the active site of 
ly sozyme is striking. (b( A Ca representation of the structure of the azo-dye Reactive-Red |RR6)-binder 
in complex with its hapten (space fillings). The cavity between the CDR1 (red) and CDR2 (green) 
providesa pocket for the hapten. The CDR3 (blue) serves mainly to cover thB 'VL-slde' of the VHH 
domain. The presence of the two copper ions, Cu 1 and Cu2, is indicated. 



environment. Phe37 fills a hydrophobic pocket created 
.by the side chains of Phe47, Tyr91, Trpl03 and the 
CDR3, where the Phe residue three amino acids 
upstream of Trpl03 plays a central role (Fig. 3). In 
addition, the VHH CDR3 folds over this part of the 
domain, and covers some of the amino acids that are 
buried by the VL partner in a typical VH-VL dimer. 
This reshaped surface explains both the failure of a 
VHH to associate with a VL domain and the increased 
solubility of an isolated VHH domain* 9 . 

Riechmann described the nuclear magnetic 
resonance structure of a partial 'camelised' human 
VH (i,e. a few amino acids were substituted into 
framework two of a human VH to mimic a camel VHH ; 
Ref. 30). Mutation of the Leu45 into Arg (and Trp47 
into He to improve expression levels) rendered the 
isolated human VH domain more soluble. However, 
these mutations induced unexpected backbone 
deformations at positions 37-38 and 45-47. and the 
side chain of Trpl03 took a completely new position. 
Thus, it appears that the backbone scaffold of the 
original VH and VHH are superimposable. whereas 
the partial 'camelisation' of a human VH by Ile47Gly 
and Val37Phe mutations introduces bulges and 
side-chain deformations at adjacent p strands. 

Antigen-binding loops 

In the absence of the VH-VL combinatorial diversity, 
new mechanisms have to be introduced to increase 
the diversity of the antigen-binding loops within one 
domain. In addition, there needs to be compensation 
for the loss of antigen-interacting surface contributed 
by the hypervariable loops of the VL. Apparently, this 
has been achieved largely because of distinct 
differences in the organization of the hypervariable 
loops. First, the CDRl of VHHs is extended towards 
the N-terminal end (Fig. 2). These amino acids form a 
loop connecting two (5 strands of adjacent sheets in the 
immunoglobulin domain. In VHHs this region is more 
variable in sequence than in VHs, probably because of 
the acquisition of somatic mutations that are selected 
during the affinity maturation process. Indeed, DNA 
mutational hotspots (i.e. DNA sequences that are 
more susceptible to mutation* 1 ) are imprinted in this 
region of the VHH germline genes but not in VHs 
(Ref. 19). and the amino acids encoded by these 
hotspot sequences were proven to interact directly 
with antigen-- 1 -\ Second, the conformation of the 
CDRl and CDR2 in a VHH often deviates from 
canonical structures in human or mouse VHs 
(Ref. 32). However, there is no a priori reason why the 
CDRl and CDR2 of conventional VHs could not adopt 
these alternative conformations. Third, the CDR3 of a 
VHH is. on average, longer than that of VHs. and is 
also moreaccessible to solvent, thus creating a larger 
surface area available for antigen interaction. 

VHH-antigen binding 

The presence of an enlarged CDRl and CDR3 in 
VHHs. and loops that exhibit alternative canonical 
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structures, increases the structural repertoire of the 
antigen-binding site in the single-domain VHH and 
compensates for the absence of the three VL CDRs. 
Altogether, the architecture of the antigen-binding site 
(paratope), and the antigen interaction mode, of VHHs 
are very diverse. As expected, planar paratopes are 
observed to interact with a proteinaceous antigen. But 
surprisingly, in an RNase A-binder, the contacts are 
only made with two loops 24 , CDR1 and CDR3, whereas 
in the carbonic anhydrase-binder only one single loop, 
CDR3, is involved in antigen interaction. In another 
example, ten consecutive amino acids of the CDR3 
protrude from the antigen-binding site and penetrate 
into the active site of lysozyme where they mimic the 
natural substrate of the enzyme 33 (Fig. 4a). Such a 
striking formation of a large convex paratope by a 
protruding CDR loop has not yet been observed in 
conventional antibodies. Furthermore, it provides an 
antigen-VHH interface area of -1700 A a , which is as 
large as the interface between antigens and a VH-VL 
pair 34 . Finally, a third paratope architecture was 
observed for a hapten binder. Haptens are normally 
captured in a groove or cavity at the VH-VL 
interface 3 ' 6 . Despite the absence of the VL, a llama 
VHH was able to form a cavity with its three CDRs to 
accommodate the hapten 27 (Fig. 4b); however, the 
importance of CDR3 in the interaction was reduced. 

Concluding remarks 

The recombinant VHH is a minimal-sized, intact 
antigen-binding domain derived from in vivo matured 
camel or llama heavy -chain antibodies. It is 
extremely stable and binds antigen with affinities in 
the nanomolar range. The absence of a VL domain 
permits the VHHs many structural variations that 
are not permissible in VH domains associated with 
VLs. Some of the amino acids within the VH CDRs 
participate in the VH-VL contact, and mutations at 



these spots are consequently forbidden or at least 
limited. Such a restraint is not an issue for the amino 
acids within the CDRs of VHHs. The subtle 
substitutions and structural rearrangements that 
occur between the VL-interacting surface of a VH and 
the corresponding side of the VHH provide an ' 
example of natural protein engineering converting a 
heterodimeric protein into a monomeric protein. 

The absence of VL-antigen contacts is 
compensated for in a VHH by an extended CDRl and 
a more exposed CDR3, The structural repertoire of 
the antigen-binding site of VHHs is therefore diverse, 
and new canonicalstructures have been identified. 
Besides the standard architectures such as cavities 
and planar surfaces, the antigen-binding site of VHH 
also includes protruding loops. Thus, a monomeric 
domain that interacts with antigens also has 
advantages. Indeed, the absence of the VL domain 
means that the paratope is concentrated over a 
smaller area so that small, hidden epitopes can still 
be targeted. It seems that heavy-chain antibodies or 
conventional antibodies recognize different antigenic 
sites. For example, in contrast to conventional 
antibodies, the camel heavy-chain antibodies interact 
preferentially with the active site of enzymes 35 . Hence 
the VHHs directed against enzymes often appear to 
be potent inhibitors. The design of small enzyme 
inhibitors derived from the antigen-binding site of the 
VHHs will be simplified owing to the reduced 
complexity of the VHH paratope, which will have 
three instead of six antigen-binding loops. However, it 
is expected that single-domain antibodies of camelids 
will find their way into many more biotechnological 
applications, especially whenever large quantities are 
needed at low cost, and when immunofusions, 
immunomodulation or robust molecular recognition 
units in affinity adsorbents, or in antibody -based 
protein chips, are envisaged 36 ' 38 . 
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Interactions between prion protein 
isoforms: the kiss of death? 



Byron Caughey 



Direct interactions between the normal and aberrant forms of prion protein 
appear to be crucial in the transmission and pathogenesis of transmissible 
spongiform encephalopathies (TSEs) or prion diseases. Recent studies of such 
interactions in vitro have provided mechanistic insight into how TSE-associated 
prion protein might promote its own propagation in a man Iner that is specific 
enough to account, at least in part, for TSE strains and species barriers. 



In its normal state, prion protein (PrP) seems 
innocent enough - a cell-surface glycoprotein that is 
present in most mammalian tissues. Occasionally, 
however, things go wrong with this protein and it 
accumulates in abnormal forms that cause 
devastating neurodegenerative diseases called ' 
transmissible spongiform encephalopathies (TSEs) or 
prion diseases 1 ' 2 . TSE diseases include scrapie of 
sheep and goats, bovine spongiform encephalopathy 
(BSE or mad cow disease), human Creutzfeldt- Jakob 
disease (CJD), and chronic wasting disease (CWD) of • 
deer and elk. A key feature of these diseases, besides 
the accumulation of abnormal PrP isoforms, is their 
transmissibility. Although the infectious agent, or 
prion, is not fully understood 3 , substantial evidence 
suggests that it requires anabnormal, usually ' 
protease-resistant, form of PrP(Refs 2,4). Thus, 
Stanley Prusiner's proposal 2 that an abnormal form 
ofPrP is the main component of the infectious agent 
remains a predominant, but controversial, Working 
hypothesis in the TSE field. 

J.S. Griffith originally proposed that a protein alone 
could serve as the infectious TSE agent if it were an 
abnormal, pathogenic form of a host protein that could 
induce its normal counterpart to convert to the abnormal 
form 6 . Since the discovery of PrP, evidence has mounted 
in favor of the importance of precise interactions 
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between the normal and aberrant forms of PrP in TSE- 
agent propagation, transmission and pathogenesis 1 - 4 6 . 
Exciting recent advances in the understanding of TSE 
neuropathogenesis have been reviewed elsewhere 1 . 
This review will focus on the molecular interactions 
between the different PrP isoforms and how they 
might relate to TSE pathobiology. 

PrP isoforms: normal versus abnormal 

Normally, in its mature form after removal of N- and 
C-terminal signal sequences, cellular PrP (PrP 0 ) 
contains -210 amino acid residues, two Asn-linked 
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Fig. 1. Model of PrP c structural domains.The folded C-terminal portion 
of PrP c thatcontains the short p-sheet strands (yellow arrows! and the 
a helices (pink! is based on a model derived from the nuclear magnetic 
resonance (NMRt-based coordinates of residues 1 24-226 of hamster 
PrP {Ref. 36). The remainder of the molecule appears, by NMR. to be 
flexibly disordered. Abbreviations: GPI, glycophosphatidylinositol 
moiety; PrP c , cellular prion protein. 
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Single domain antibodies: comparison of camel VH and 
camelised human VH domains 
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camel heavy chain-only andbodies and VH domains from conventional human antibodies helped to design an altered human 

S£r m ' i S kT /rl , Pt0V t ' CamCl VH ' t0 bC a Sma11 ' I0bust "* efficient unit formed by a 

single immunoglobulin (Ig) domam. Biochemical, structural and antigen binding characterisation properties of both camel 
VH domains and camelised human VH domains suggest that these can compete successfully with single chain variable 
uZ; I fragments frora ct ™tional antibodies in many applications. Of special importance in this respect is the use of 
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Antibodies have long been considered as a power- 
ful tool to recognise and target almost any molecule 
with a high degree of specificity and affinity. For 
such purposes, natural antibo"dies can be obtained as 
mixtures in the form of antisera from immunised 
animals (Cohn et al., 1949) or as monoclonal anti- 
bodies from hybridomas (Kohler and Milstein, 1975). 
More recently, recombinant DNA technology has 
allowed the cloning and genetic manipulation of 
antibody genes, which can then be expressed as 
recombinant antibodies in eukaryotic cells (Boulianne 
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et al., 1984; Neuberger et al., 1985). To obtain good 
yields of active antibody protein also from bacteria, 
it was necessary to change the antibody format from 
the full-length molecule, consisting of two multido- 
main heavy and light chains each, to smaller versions 
like the antigen binding fragment (Fab), or variable 
domain (Fv) fragment (Better et al., 1988; Skerra 
and Pluckthun, 1988). These fragments all contain 
ordinary antigen binding sites formed by a single 
pair of N-terminal variable domains from heavy and 
light chain, but they contain no (Fv) or only one 
constant domain (Fab) per chain. Apart from en- 
abling better expression in bacteria, these smaller 
formats have other advantages as they, for example 
result in a better distribution and faster clearance 
than larger antibody molecules when used in vivo 
(Yokota et al., 1990). Their smaller size makes them 
also more suitable for structural studies like nuclear 
magnetic resonance (NMR) spectroscopy (McManus 
and Riechmann, 1991; Riechmann et al., 1991). 

It was therefore attempted to create even smaller 
fragments of antibodies with adequate antigen bind- 
ing activities. To decrease size significantly below 
that of Fv fragments, such minimal recognition units 
have to be based on single domains. And, indeed 
already very early experiments indicated that anti- 
body heavy chains , can occasionally bind antigens in 
the absence of their light chain partner (Utsumi and 
Karush, 1964). These results were corroborated, when 
single heavy chain variable (VH) domains were iso- 
lated from bacterial expression libraries of heavy 
chain variable regions from immunised mice (Ward 
et al., 1989). 



chain antibodies could be separated from the conven- 
tional heterotetrameric antibodies by differential ad- 
sorption on proteins A and G. Later on, the sequence 
of cDNA clones of spleen or blood lymphocytes 
revealed the presence of three or four different heavy 
chain antibody isotypes in dromedary and llama 
respectively (Vu et al., 1997). These cDNA se- 
quences lack the exon coding for the first constant 
domain. The heavy chain polypeptide of heavy chain 
antibodies is therefore composed of the variable 
domain, immediately followed by the hinge, CH2 
and CH3 domains (Fig. 1). The absence of the CHI 
domain explains the absence of the light chain in the 
heavy chain-only antibodies, as this domain is the 
anchoring place for the constant domain of the light 
chain (Padlan, 1994). Besides the presence of im- 
munoglobulin (Ig) heavy chains without CHI the 
analysis of the cDNA clones revealed that sera of 
camehds contain also two different 7-isotypes with a 
CHI domain (Vu et al., 1997). Evidently, these latter 
encode the heavy chains of the conventional antibod- 
ies within the Camelidae. 

A pathological disorder in humans or mice, known 
as heavy chain disease, occurring in humans or mice 
is characterised by the presence of heavy chain 
antibodies in their sera (Seligmann et al., 1979) 
TTiese truncated antibodies result from a 'somatic 
event that removes various parts of the VH and CHI 
region from the expressed Ig gene. The mouse or 
human heavy chain antibodies however are not func- 
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2. Camel heavy chain antibodies 

While these results suggested that it might well be 
possible to generate single domain antibodies in 
vitro, it was then discovered that heavy chain-only 
antibodies had also evolved in vivo. In camels and 
other camelid species, a significant proportion of the 
natural antibody repertoire were found to consist of 
antibodies lacking a light chain partner (Hamers-Cas- 
terman et al., 1993). These antibodies have a molec- 
ular weight of ~ 95 kDa instead of the ~ 160 kDa 
for conventional antibodies (Ungar-Waron et al 
1987). At least, two different fractions of heavy 




Fig. 1. Schematic picture of a conventional antibody (human 
•y-isotype) and a camel heavy chain antibody. The antigen binding 
site forming variable domains, which form either an Fv fragment 
in case of the conventional antibodies or a single domain antibody 
in case of the camel heavy chain antibodies, are shaded. 
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tional in antigen binding. In sharp contrast, the blood 
sampling of infected or immunised dromedaries 
showed that the immune response contained a di- 
verse repertoire of heavy chain-only antibodies, 
which were functional in antigen binding (Hamers- 
Casterman et al„ 1993). 



3. Camelising human VH domains 



:e analysis suggested that the camelid VH 
s may contain a significantly altered surface 
m that region, which in VHs from conventional 
antibodies forms the VH/light chain variable do- 
main (VL) interface (Chothia et al., 1985; Muylder- 
mans et al., 1994). The antibody VH gene family, 
which is overall most homologous to VHs from 
camel heavy chain-only antibodies, is the human 
VH3 family (Muyldermans et al., 1994; Nguyen et 
al., 1998). Their framework (FR) sequences are very 
similar except for three residues in FR2 (Fig. 2) 
which are highly conserved in VH domains from 
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most conventional antibodies. These residues (G44 
L45, W47 in the human VH3; E44, R45, G47 iii 
most camel VHs) are located in the VH/VL inter- 
face and the side chains of residues 45 and 47 point 
towards the VL in structures of conventional anti- 
bodies (Chothia et al., 1985). 

To analyse the effect of these mutations, a human 
VH3 was expressed as an isolated domain in Es- 
cherichia coli and the three mutations were intro- 
duced in its former VL interface (Davies and Riech- 
mann, 1994). The original human VH and two 
camelised mutants (G44E, L45R, W47I or W47G) 
were all found to be monomelic at low protein 
concentrations. However, the original human VH, in 
the absence of a VL domain, started to aggregate at 
higher protein concentrations (1 mg/ml and above) 
causing severe line broadening in NMR analysis 
(Davies and Riechmann, 1994). Aggregation was 
significantly reduced for the camelised VH domains. 
Thus, the camelised human VH domains had a much 
improved linewidth (transverse proton relaxation time 
T 2 was increased from 14.5 up to 29 ms) in NMR 
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experiments, which enabled their structural analysis 
in solution (Davies and Riechmann, 1994; Riech- 
mann, 1996; Riechmann and Davies, 1995). A very 
limited number of mutations in the former VL inter- 
face had therefore made it possible to create a human 
single VH domains, which behaved well in solution. 

Apart from enabling me NMR analysis of the VH 
domain, this improvement opened the possibility to 
create single domain recognition units based on hu- 
man heavy chain variable domains. Such camelised 
human VH domains may turn out to be more suitable 
than camel VH domains for any future in vivo use. 
They further carry, in case of the human VH3 do- 
main, the additional advantage of being recognised 
by the bacterial superantigen protein A, which facili- 
tates highly specific and efficient purification. Pro- 
tein A binds to a nonlinear epitope within the VH 
(Riechmann and Davies, 1995) and thereby will 
result in purification of correctly folded VH protein 
only. 

4. Preparation of specific camel heavy chain anti- 
bodies 

The immunisation of llamas and dromedaries fol- 
lowing standard protocols (complete and incomplete 
freund adjuvant; three to four boosts with 50 |xg to 1 
mg immunogen per animal) generates specific heavy 
chain-only antibodies of good titres. Approximately 
0.1 mg of polyclonal heavy chain-only antibodies 
with specificity for a-amylase could be trapped by 
batch adsorption from 1 ml serum of an immunised 
dromedary (Lauwereys et al., 1998). 

The polyclonal heavy chain-only antibodies of the 
IgG3-type purified on proteins A and G can also be 
used as a source to isolate variable domains. These 
variable domains are then obtained after a limited 
proteolytic digestion of IgG3 with endo-Glu V8 pro- 
tease. This enzyme cleaves the short hinge region 
between the VH and CH2. Protein A chromatogra- 
phy retains the Fc containing fragments and a sub- 
fraction of the camel VHs. The flow-through con- 
tains the majority of the VH domains, of which 
measurable amounts could recognise the antigen 
(Lauwereys et al., 1998). 

To obtain recombinant forms of camel VH do- 
mains, the repertoire of heavy chain-only antibody 
variable domains from an immunised camel are 



cloned in bacteria (Ghahroudi et al., 1997). Expres- 
sion libraries of such VH repertoires can be screened 
for the presence of antigen specific binders. To avoid 
the contamination with VH genes originating from 
conventional antibodies a two-step polymerase chain 
reaction (PCR) was proposed (Ghahroudi et al., 
1997). In the first PCR on cDNA template of blood 
lymphocytes a set of primers, which anneals at 
the first codons of the VH and within the CH2 
region, were used to amplify all -y-isotypes. The 
PCR products originating from the heavy chain-only 
antibodies could be selectively eluted after gel elec- 
trophoresis since the fragment of the heavy chain- 
only antibody -y-isotypes are ~ 350 nucleotides 
shorter due to the absence of the CHI exon. The 
entire VH region is then reamplified with nested 
primers annealing at the codons of FR1 and FR4 of 
the VH, respectively. The resulting PCR product is 
finally ligated in a phage display vector (see below) 
adapted to clone a VH region only. A VH library of 
10 6 -10 7 individual clones, from which typically sev- 
eral antigen binders were selected, can be obtained 
from as little as 5-10 ml blood. The whole proce- 
dure from the first immunisation to the identification 
of binders can be performed in less than 3 months. 
Following this procedure, more than 20 binders 
against a variety of proteinaceous antigens were 
isolated (Table 1). These VHs bound their antigen 
specifically with dissociation constants from 100 nM 
into the subnanomolar range (Muyldermans and 
Lauwereys, 1999). This affinity is similar to that 
measured for conventional antibodies from a sec- 
ondary immune response (Foote and Milstein, 1991). 

Although phage display or expression libraries for 
Fv or Fab fragments of conventional antibodies can 
also be prepared from blood of immunised or in- 
fected mice and humans (Persson et al., 1991; Win- 
ter et al., 1994), the single domain VH libraries of 
immunised camels have several advantages. First, all 
the variable domains of camelid heavy chain anti- 
bodies belong to one single family (VH3). Such a 
VH library can therefore be generated with a single 
set of PCR primers. Secondly, the entire paratope is 
located on a single domain present in a single exon 
that is cloned as one entity. In contrast, the VH 
genes of mouse and human antibodies belong to 
several families so that multiple PCRs with different 
sets of primers are necessary to clone the complete 
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Table 1 

Affinities of camel VH domains and camelised human VH do- 
mains 

Data for the camel VH are from Ghahroudi et al. (1997), Lauw- 
ereys et al. (1998) and K. Conrath (personal communication), date 
for the camelised human VHs are from Davies and Riechmann 
(1996b). The antigen used were ct-amylase, carbonic anhydrase, 
p-lactamase, 4-glycyl-2-phenyloxazoI-5-one (OxGly), 3-iodo-4- 
hydroxy-5-nitrophenyI-acetyl caproic acid (NlP-CAP), the HTV 
transcription factor rev and hen egg lysozyme. The length of the 
H3 region refers to the number of residues between 95 and 102 
inclusive using the Kabat nomenclature (Kabat et al., 1991). 



Ligand K d (nM) H3-length 



Camel VH 








cAb-AMD7 


a-amylase 


15 


16 


cAb-AMD9 


oc-amylase 


3.5 


14 


cAb-AMBlO 


ot-amylase 


24 


14 


CAB-CA04 


carbonic anhydrase 


29 


18 


cAb-CA06 


carbonic anhydrase 


20 


13 


cAb-Lys2 


lysozyme 


2 


19 


cAb-Lys3 


lysozyme 


65 


24 


cAb-j31a01 


j3-Iactamase 


<l 


21 


cAb-(3Ia02 


P-lactamase 


1.5 


S 


cAb-pla03 


(3-lactamase 


10 


17 


Camelised VH 








VH-Ox21 


OxGly 


146 


15 


VH-OSI 


OxGly 


25 


15 


VH-Ox21.2.4 


OxGly 


47 


15 


VH-Ox62 


OxGly 


267 


15 


VH-N3cl 


NIP-CAP 


292 


15 


VH-N3cI.2.2 


NIP-CAP 


31 


15 


VH-REVgl 




220 


11 


VH-REVrsl 




401 


11 


VH-LS2 


lysozyme 


3100 


10 


VH-LS2.S.1 


lysozyme 


1600 


10 



repertoire. In addition, the Fv has to be reconstituted 
from the combination of a VH and a VL domain. 
These, must be amplified as separate gene fragments, 
which are randomly combined afterwards. Therefore, 
original pairs of VH and VL domains with antigen 
binding activity that were matured during the im- 
mune response as one entity can only be recovered 
by random combination requiring Fv libraries of 
relatively large size, which at times approaches the 
limitations of bacterial transformation efficiency. 

5. Preparation of specific camelised VH domains 

While camel heavy chain-only antibodies of a 
desired antigen specificity can be rescued from natu- 
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ral sources, camelised VH domains are synthetic and 
must therefore be generated in vitro. For this, the 
human VH3 domain with the camelised VL interface 
(i.e., with mutated residues 44, 45 and 47) was used 
as a building block for the creation of designed 
libraries. VH repertoires can most simply be created 
through the introduction of randomised regions into 
the VH gene to vary the hypervariable loops both in 
length and amino acid residue nature. The three 
hypervariable loops of VH and VL form the antigen 
binding site in ordinary antibodies (Chothia et al 
1989). 

To facilitate selection of a repertoire of camelised 
VH domains, these were displayed, like other anti- 
body fragments before, on filamentous phage by 
fusion to the N-terminus of the minor phage coat 
gene 3 protein (g3p) on the gene level (Davies and 
Riechmann, 1995a). Each phage particle displays its 
individual VH, . which is encoded in the encapsulated 
phage genome. VHs can therefore be selected, for 
several rounds if needed, through panning of phage 
on immobilised antigen. Bound phage is rescued and 
regrown through infection of bacteria. Selected VHs 
can be analysed (and indeed be used for detection or 
targeting) for their antigen specificity and affinity as 
phage displayed VH or as soluble VH. Soluble VH 
can be produced after either subcloning into a solu- 
ble expression vector or through the use of bacterial 
suppresser strains and appropriate stop codons be- 
tween the VH fusion and the phage g3p (Hoogen- 
boom et al., 1991). 

Repertoires of camelised VH domains were ini- 
tially created by randomisation of residues within the 
third hypervariable loop H3, which at the same time 
was varied in length. In VH domains from conven- 
tional antibodies and indeed camel heavy chain-only 
antibodies, this loop contains among the three VH 
hypervariable loops the highest diversity in length 
and amino acid nature (Kabat et al., 1991; Wu et al., 
1993). From such a repertoire of 2 X 10 s clones, 
camelised VH domains specific for hapten, peptide 
and protein antigens were selected (Davies and 
Riechmann, 1995a, 1996a; Martin et al., 1997). The 
dissociation constants for these VH domains in their 
soluble form and their respective antigens were in 
the nanomolar to micromolar range (Table 1). The 
affinities could be improved up to tenfold by phage 
selection of VHs containing secondary mutations 
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within the other two hypervariable loops (Davies and 
Riechmann, 1996b). 

These results show that camelised VH domains 
present a useful building block for the preparation of 
Ig-based recognition units of minimal size. Selected 
VH domains proved to be highly specific and of 
reasonable affinity, which can be increased through 
the preparation of larger repertoires or secondary 
mutations. 

The same experiments, however, also made obvi- 
ous that some properties of camelised VH domains 
can be improved upon. These are related to the 
former VL interface of the VH. The three camelising 
mutations (G44E, L45R and W47G), which in- 
creased the solubility of the single human VH do- 
main most and lead to the least nonspecific binding 
of phage displayed VH domains, compromised the 
stability and most significantly the expression yield 
of active protein for the resulting soluble VH (Davies 
and Riechmann, 1994, 1995a). Camelised VH do- 
mains, which had been selected with a glycine at 
position 47, were therefore expressed as G47I mu- 
tants in their soluble version. This had in most cases 
minor or no effects on antigen affinity. The G47I 
mutation increased in all cases stability and purifica- 
tion yields of active VH protein significantly. Unfor- 
tunately, constitutive use of isoleucine at position 47 
of the VH for phage display lead to an unacceptably 
high number of nonspecificaUy binding clones in the 
repertoire, which compromised selection (Davies and 
Riechmann, 1995a). 

Further mutations in the camelised, human VH 
domain appear to be necessary to create a prototype 
FR, which exhibits low nonspecific binding and high 
expression yields at the same time. Thus, it was 
indeed possible to improve protein stability of 
camelised VH domains significantly through the in- 
troduction of a new intradomain disulphide bridge 
between cysteines at positions 33 (in FR2) and 100b 
(in H3), which could even be constitutively intro- 
duced in repertoires of camelised VH domains 
(Davies and Riechmann, 1996a). 

Another problem concerns the multimerisation 
state of the selected VH domains. While most se- 
lected camelised VH domains were monomelic, 
dimeric domains also were occasionally selected 
(Martin et al., 1997). This obviously depends on the 
nature of the H3 loop as this is the only portion of 



the camelised VHs, which is different among the VH 
domains in the original repertoire. 

Concerning the generation of diversity within 
repertoires, increased variation both in length and 
sequence of the H3 loop in camelised VH domains is 
probably the most effective and simplest method. 
Sequences from antigen specific, natural camel heavy 
chain antibodies after immunisation suggest that the 
H3 loop has an even more central role for antigen 
binding in heavy chain-only antibodies than it has in 
conventional antibodies (Muyldermans et al., 1994). 
Repertoires based on completely randomised H3 
loops can be further diversified through a limited 
number of mutations in the other two hypervariable 
loops. 



6. Expression of recombinant heavy chain-only 
antibodies and single VH domains 

6.1. Phage antibody fragments 

For the phage display of antibodies, these are 
typically fused as Fab or single chain Fv (scFv) 
fragments to the phage g3p protein. Once a specific 
antibody clone has been selected, this can be used 
for many applications directly in this phage format 
(Nissim et al, 1994). For example, selected phages 
can be most easily screened from supernatants of 
infected bacteria in enzyme-linked immunoadsorbent 
assay (ELISA) for antigen binding, where they are 
detected with an anti-phage antibody (e.g., anti-M13 
monoclonal antibody; Pharmacia). This essay is 
highly sensitive as the phage coat contains about 
2700 copies of the major phage coat protein gene 8 
protein (Model and Russel, 1988), which forms the 
epitope for the anti-phage antibody. When a pure 
phage system (rather than a phagemid in combina- 
tion with a helper phage) is used to provide the 
g3p-fusion, the phage contains up to five copies of 
the antibody fragment resulting in avidity effects, 
which will allow the detection of even very weak 
binders. 

Concerning the use of phage displayed antibody 
fragment, there is no difference between ordinary 
heavy and tight chain antibody fragments and single 
chain polypeptide antibodies like the camel heavy 
chain-only antibodies or camelised human VH do- 
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mains. VHs from camel heavy chain antibodies and 
the camelised human VH domains have both been 
successfully fused to g3p (Davies and Riechmann, 
1995a; Ghahroudi et al., 1997) and were suitable for 
detection of their antigens in ELISA. Their use in 
other application, like histological staining or cell 
sorting, has not been tried, but there is no reason 
why they should perform any worse or better than 
ordinary antibodies. 

6.2. Soluble antibody fragments of camel heavy 
chain-only antibodies 

The genes for camel VH domains can be inserted 
by cassette mutagenesis into suitable expression vec- 
tors to produce larger multidomain or multifunc- 
tional proteins. For example, an intact heavy chain- 
only antibody was readily generated by cloning a 
particular camel VH in front of the hinge and effec- 
tor function domains of human IgGl. This construct 
- can be expressed in bacteria, but expression in mam- 
malian cell lines will add the proper glycosylate at 
the CH2 domain. Such constructs in a pcDNA3 
vector (Invitrogen) produced after transient expres- 
sion in COS cells some 5 mg/1 medium (K.B. Vu, 
personal communication). These chimeric heavy 
chain antibodies were fully active in antigen binding. 

Multimerised forms of camel VHs linked on a 
single gene through a peptide were also obtained 
from bacteria yielding multivalent recognition units 
(K. Conrath, personal communication). Two camel 
VHs directed against different antigens should form 
bispecific recognition units when linked on the gene 
level. Equivalent constructs have been made using 
conventional scFv as molecular building blocks (Neri 
et al., 1995). However, the single domain nature of 
the camel VH will most likely yield higher func- 
tional expression levels. 

We are convinced that camel VH will be most 
useful in diagnostic applications where stable and, 
small reporter molecules are required. Camel VHs 
can routinely be expressed with a His tail in the 
bacterial periplasm. The average purification yield 
obtained after purification by immobilised metal 
affinity chromatography and gel filtration is between 
0.5 and 5 mg/1 culture. These recombinant camel 
VHs do not dimerise and can be concentrated to 10 
mg/ml without aggregation. The VHs are stable, 
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they resist incubations at 37°C for 1 week and T 
values between 60°C and 72°C were measured 
(Ghahroudi et al, 1997). 

6.3. Soluble camelised human VH domains 

Camelised VH domains, which were selected from 
phage display libraries, have been expressed as solu- 
ble VH domains after subcloning into E. coli expres- 
sion vectors directing folded protein to the periplasm 
through fusion to a suitable leader peptide (Davies 
and Riechmann, 1995a). Camelised VHs containing 
a glycine at position 47, which was most success- 
fully used for selection after phage display, yielded 
up to 1 mg functional protein from 1-1 bacterial 
culture grown in shaker flasks using protein A 
sepharose for purification from both supernatants or 
periplasmic preparations. The same VH domains, 
expressed with an isoleucine at position 47, yielded 
up to 10 mg active protein from 1 1 of culture. The 
same was usually the case even when the VHs 
contained two additional cysteines (Davies and 
Riechmann, 1996a). ScFvs rescued from phage dis- 
play libraries or indeed engineered from monoclonal 
antibodies have often similar purification yields 
(Skerra, 1993), which however can be highly vari- 
able depending on the particular scFv used. This is 
due to the wider structural variety of scFvs, while 
most camelised VHs, which are identical except for 
their H3 loop, give purification yields in the dis- 
cussed range. In addition, expression of both camel 
VH domains and camelised human VH domains 
is not compromised by the presence of a linker 
between VH and VL as in a scFv. Such linkers can 
interfere with folding and can be susceptible to 
proteolysis (Whitlow et al., 1993). 

Concerning the stability of the camelised VH 
domains, these are usually considerably higher than 
those of scFv or ordinary Fv fragments, which can 
fall more easily apart due to their two-domain archi- 
tecture. The camelised VH domains containing a 
glycine at position 47 have melting points of about 
60°C, while those with an isoleucine at position 47 
have a T m of about 70°C (Davies and Riechmann, 
1995b, 1996a). Other mutations, like the change of 
valine 37 to phenylalanine and the introduction a 
second intradomain disulpbide bond between newly 



Riechmann, S. Muyldermans /Journal of Immunological Methods 231 (1999)25-38 



introduced cysteines at positions 33 and 100b, which 
were also adapted from natural camel VH domains, 
increased the thermostability of the VH domains to 
almost 80 6 C (Davies and Riechmann, 1996a). Again, 
as with purification yields, stabilities of scFvs are 
much more variable due to their more diverse nature 
of underlying VH and VL genes in addition to the 
variable lability of the VH/VL interaction. 

Other features of camelised VH domains are less 
straightforward to compare with those of other for- 
mats of antibody fragments. Thus, camelised VH 
domains with long third hypervariable loops were 
occasionally cleaved by bacterial proteases. Al- 
though this may also be occasionally observed in 
scFvs, it will be less frequent in the case of conven- 
tional antibody combining sites as these contain usu- 
ally shorter H3 loops. 

Another observation is the formation of dimeric, 
camelised VH domains (Martin et al., 1997). Dimeri- 
sation is also not unknown for scFvs. In the case of 
scFvs, it was possible to turn that into a feature 
rather than a problem. Through the use of short 
VH/VL linkers, dimerisation (and even trimerisa- 
tion) can even be enforced and can lead to so-called 
diabodies with two identical binding sites, or even 
different specificities when VH and VL domains 
from two different scFvs are linked on the same gene 
(Holliger et al., 1993). In the case of single VH 
domains, dimerisation through covalent linkage of 
two different VHs should also lead to avidity effects 
or to the creation of dual specificity. However, such 
modifications will compromise the two most impor- 
tant advantages of VH domains, which are small size 
and high stability. 

As far as aggregation of camelised VH domains 
compared to other antibody fragments is concerned, 
this will very much depend on the nature of the 
particular fragment. Most camelised VH domains or 
Fv and Fab fragments do not aggregate, however, 
occasionally fragments are prone to aggregation de- 
pending on particular sequences within the proteins. 



7: Structures of camel VH domains 

The crystal structures of one llama and two 
dromedary VH domains are available in the protein 
data bank. The llama VH with specificity for human 



chorionic gonadotropin was solved to 1.8 A resolu- 
tion (pdb-file 1HCV) (Spinelli et al., 1996). The 
dromedary VHs, cAb-Lys3 (pdb-file 1MEL) cAb- 
RN05 (pdb-file 1BZQ) and cAb-RN05 (pdb-file 
1BZQ), were crystallised with their respective anti- 
gens, hen egg-white lysozyme and bovine RNase A 
(Desmyter et al., 1996; Decanniere et al., 1999; Fig. 
3). In all three cases, the Ig fold of the VH is well 
preserved. Two 3-sheets (one with four and one with 
five p-strands) are packed against each other and 
stabilised by a conserved intradomain disulphide 
bond between C22 and C92. The rms deviation 
between the core of any two camel or llama VH and 
human VH (of family 3) is between 0.27 and 0.69 A 
(Decanniere et al., 1999). 

The side of the camel VH domain corresponding 
to the VL interacting face of the normal VH in an Fv 
has a quite different architecture. Compared to the 
human VH, four amino acid substitutions (V37F 
G44E, L45R and W47G or W47S) are located in this 
region. The substitutions at positions 44, 45 and 47 
were those, which were used to camelise the human 
VH and rendered the isolated domain more soluble 
(Pavies and Riechmann, 1994). The nonpolar to 
polar amino acid substitutions (G44E and L45R) 
increase the hydrophilicity of the surface. The substi- 
tutions at positions 37 and 47 cause a net shift of the 
bulky hydrophobic groups. In case of the two camel 
VHs with a known structure, the H3 loop folds over 
these residues and makes them solvent inaccessible 
(Decanniere et al., 1999). 

From a survey of all human and mouse VH 
antigen binding loop structures, it became apparent 
that only a restricted number of possible conforma- 
tions were encountered (Chothia and Lesk, 1987; 
Chothia et al., 1992; Al-Lazikani et al., 1997). Three 
and four different conformations are described for 
the first and second antigen binding loop, respec- 
tively. These so-called canonical structures are deter- 
mined by the length of the loop and the presence of 
particular residues at key positions. The H3 loop is 
extremely variable in length and sequence (Wu et al., 
1993). Consequently, the prediction of the loop ar- 
chitecture remains more speculative despite recent 
progress (Martin and Thornton, 1996; Shirai et al., 
1996; Morea et al., 1998). Surprisingly, the antigen 
binding loop structures of camel VHs deviate from 
the present canonical loop definitions of human and 




mouse VHs. This deviation could not be predicted, 
because the loop length and the residues at the key 
positions are very similar between camel VH and 
human VH. Consequently, the prediction of the camel 
VH hypervariable loop structures by the current 
canonical loop structure algorithms is not reliable. 
The additional canonical loop structures in camel 
VHs make the structural repertoire of their paratope 
larger than that of VH domains in Fv fragments from 
conventional antibodies. 

Moreover, the hypervariable region around the 
first antigen binding loop in VHs of camel heavy 
chain-only antibodies is enlarged to cover residues 
26 to 35 (Vu et al., 1997). In human VHs, the first 
hypervariable region only comprises positions 31 to 
35 (Kabat et al., 1991). This indicates that residues 
26 to 30 in camel VHs may more frequently partici- 



pate in antigen binding. Indeed residue 29 (129) of 
cAb-Lys3 interacts directly with the antigen lysozyme 
(Desmyter et al., 1996) and the hydroxyl group of 
Y27 of cAb-RN05 is hydrogen bonded with its 
RNase A antigen (Decanniere et al., 1999). The first 
amino acid of conventional VHs, which usually in- 
teracts with antigen, is located at position 30 
(Torrdinson et al., 1996). The extension of the first 
hypervariable region in camel VHs and a concomi- 
tant enlarged antigen binding surface compared to 
that of a VH in a conventional antibody appears to 
compensate in part for the absence of a VL domain. 

However, the antigen binding surface of cAb- 
RN05 remains small (570 A 2 ) compared to Fv 
paratopes interacting with proteins (613-841 A 2 ) 
due partly to a lack of participation of the H2 loop in 
antigen binding (Fig. 3). A large number of main 
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chain hydrogen bonds and van der Waals contacts 
leads nevertheless to an dissociation constant with 
antigen of 30 nM. 

The cAb-Lys3 has a very long H3 loop of 24 
residues, of which the first 10 protrude from the 
antigen binding site (Fig. 3). As a consequence, the 
actual antigen binding surface (847 A 2 ) becomes 
even larger than that of most conventional Fvs (613- 
841 A 2 ). The protruding H3 loop is constraint by a 
disulphide bond towards the HI loop and by the 
parallel stacking of two tyrosines in the interior of 
loop H3 (Desmyter et ah, 1996). 

No antigen binding data are available for the 
llama VH. However, its short H3 loop (Spinelli et 
al., 1996) suggests that the actual antigen binding 
surface will be correspondingly small. 



8. Structure of a camelised VH domain 

The structure of one camelised human VH do- 
main was solved in solution by NMR spectroscopy 
(pdb-file 1VHP). This VH contains three camelising 
mutations (G44E, L45R and W47I) in its former VL 
interface. The overall (3-sheet structure is very simi- 
lar to that of a noncamelised, VL-associated human 
VH3 domain (Pot; Fan et al., 1992) with a C" rms 
of 2 A for 82 (out of 113) aligned residues (Riech- 
mann, 1996). The first two hypervariable loops of 
the camelised VH and the Pot-VH have the same 
length and adopt the same canonical structures. The 
H3 loop folds in the Pot-VH towards the VL and 
participates in VH/VL interactions. The (by four 
residues shorter) H3 loop in the camelised VH in 
contrast is orientated more towards the HI and H2 
loops. This is also in conflict with the camelid VH 
domains where the H3 loop, or part of the H3 loop, 
covers the side of the VH, which would form the VL 
interface in conventional antibodies. In the camelid 
VHs, the H3 loop covers hydrophobic regions of the 
VH, which would otherwise be exposed to the sol- 
vent. The camelising mutations reduce the hydropho- 
bic character of the protein surface both in the camel 
and the camelised VHs, as three hydrophobic residues 
(44, 45 and 47) are replaced by more hydrophilic 
amino acids. Hydrophobicity of the former VL inter- 
face in the camelised VH was further reduced by the 



reorientation (compared to a VL-associated VH) of 
the side chains of the nonmutated residues 37, 38 
and 103. The hydrophobic side chains of residues 
V37 and W103 are completely or partly buried in the 
camelised VH, while they point towards to the VL in 
a VL-associated VH. The opposite is the case for the 
hydrophilic R38 side chain. These changes were not 
found in the camel VH structures and indicate a 
principle structural difference between camel and 
camelised VH domains. Whether this will also be 
true for camelised VHs with extremely long H3 
loops as in natural camel VHs remains to be seen. 
Folding back of a long H3 loop onto the former VL 
interface in camelised VH domains might well influ- 
ence its structure. 

No direct structural details are available for anti- 
gen binding by camelised VH domains, but affinity 
improvement due to changes in the HI and H2 loops 
suggest that these may also be involved in antigen 
binding (Davies and Riechmann, 1996b). The H3 
loop will almost certainly play a central role in their 
binding site as it formed the source of structural 
variety in the underlying repertoire, from which the 
VH domains were selected. 



9. Special features of single domain antibodies 

The observation of a convex paratope architecture 
in the camel VH cAb-Lys3 (Desmyter et al., 1996) 
was most remarkable since it has never been ob- 
served in Fvs (Padlan, 1996). Normally, the antigen 
binding loops of an Fv form a cavity, a groove or a 
flat surface (Webster et al., 1994). These Fv topogra- 
phies are correlated with the size and type of antigen. 
Haptens tend to be bound into cavities of the 
paratope, peptides bind into a groove and larger 
antigens such as proteins are bound by antibodies 
with flat paratopes, eventually undulated by some 
side chains to improve the complementarity of the 
surfaces (Padlan, 1996). Large protruding loops of 
10 amino acids or more seem to be unique for the 
antigen binding sites of heavy chain-only antibodies. 
This feature gives camel VHs a special niche for 
antigen recognition, as their long third hypervariable 
loop can insert into cavities of antigen surfaces. This 
is especially important, as the catalytic site of en- 
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zymes is often located at the largest cavity on their 
protein surface (Laskowski et al., 1996). Such sites 
are usually not immunogenic for conventional anti- 
bodies (Novotny et al., 1986). Camel heavy chain- 
only antibodies however are able to recognise such 
epitopes. 

Indeed, in the structure of camel VH cAb-Lys3, 
the 24 residue long H3 loop penetrates deeply into 
the active site of°lysozyme (Transue et al., 1998) 
suggesting that camel heavy chain antibodies might 
be able to form specific enzyme inhibitors. Proof 
was given by the successful retrieval of specific 
inhibitors from the VH library of a camel immunised 
with a-amylase, lysozyme and carbonic anhydrase 
(Lauwereys et al., 1998). Similarly, from a library of 
camelised VHs with randomised third hypervariable 
loops (Davies and Riechmann, 1995a) an inhibitor 
for the hepatitis C virus NS3 protease could be 
selected in vitro (Martin et al., 1997). 

Whether inhibition is indeed in all cases due to 
the insertion of long hypervariable loops into active 
site cavities of the enzymes will have to be con- 
firmed by structural studies, and it cannot be ex- 
cluded that inhibitory VHs might have additional 
strategies to inactivate enzymes. 

The paratope of camel VHs comprises only three 
antigen binding loops, of which H3 provides most of 
the contacts. The lower complexity of their antigen 
binding site might therefore make single domain 
antibodies a suitable FR for the peptide scanning 
technique (Laune et al., 1997) to design smaller 
peptides or peptide analogues with enzyme inhibiting 
or receptor blocking capacity derived from the se- 
quences of the H3 loops. 



10. Conclusions 

The most basic difference between camel VHs 
and camelised VHs is that camel VHs have naturally 
evolved and can be obtained by in vivo immunisa- 
tion. While immunisation can be mimicked in vitro 
through selection of synthetic phage displayed VH 
domains, the architecture of the camel VHs is at 
present still superior to that of camelised VHs be- 
cause of a more mature design as a result of their 
natural evolution. Camelised human VH domains 
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may need additional modifications in their now ex- 
posed, former VL interface to tolerate the exposure 
to a hydrophilic surrounding while maintaining a 
high stability and good folding properties and pro- 
viding a highly diverse antigen binding site. Once 
this disadvantage is overcome, camelised and camel 
VH domain should be equally well suited for the in 
vitro selection of antigen specific single domain 
antibodies. Concerning the in vivo selection of VH 
domains with good binding properties after immuni- 
sation of animals, camel heavy chain-only antibodies 
will obviously remain unchallenged. Advantages of 
camelised, human VHs compared to camel VH do- 
mains however include the presence of a Protein A 
binding in the case of domains based on the human 
VH3 gene family and the possibly more attractive 
use of camelised, human VHs for any therapeutic 
purpose in humans due to a probably lower immuno- 
genicity. 

Given a camel available for immunisation and an 
antigen ready for injection, it is relatively straightfor- 
ward to generate a recombinant camel single domain 
antibody with good affinity and specificity. This may 
prove particularly valuable when specific enzyme 
inhibitors are needed, as these seem to be very rare 
among conventional antibodies but frequent among 
camel heavy chain antibodies. This property appears 
to be closely related to their often very long H3 loop. 
This niche for antigen binding together with the 
advantages in size and stability should lead to an 
important role for single VH domains for biotechno- 
logical applications in the future. 

It seems therefore also desirable to establish in 
vitro systems for the generation of such single do- 
main antibodies avoiding the need for animals. To 
prepare single domain antibodies based on natural 
camel VHs in vitro, a large bank of the VH germline 
segments, which are used in the camel heavy chain- 
only antibodies, must be prepared and for example 
completed with synthetic H3 loops and a suitable 
C-terminal FR. Their display on phage will allow in 
vitro selection of specific recombinant VH domains. 
Camelised, human VH domains present a viable 
alternative to camel VHs. These however may still 
require some improvements to efficiently compete 
with the more reliable biochemical properties of 
natural camel VH domains. Such a design may 
eventually lead to domains even smaller than full- 
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length VH domains, as already shown in the case of 
the VH-based 0-domains (Pessi et al., 1993; Martin 
et al., 1994). Alternatively, it may also be possible to 
utilise light chain variable domains for this purpose, 
although these lack at least in nature the feature of a 
frequently very long hypervariable loop. 

In any case, single VH domains have so far 
proven to perform no worse than Fv or Fab frag- 
ments from ordinary antibodies concerning antigen 
binding and purification yields. In addition, single 
VH domains are definitely more stable due to their 
simpler architecture. They are therefore likely to 
challenge conventional antibodies for many biotech- 
nological applications, while in most therapeutic ap- 
plications conventional antibodies will remain the 
reagent of choice for reasons of immunogenicity. 
Possible exceptions even in therapy are antibody- 
based enzyme inhibitors, as only camel heavy 
chain-only antibodies seem to perform this function 
in vivo, and anti-idiotypic vaccinations, as camel 
VHs should raise good responses in noncamelid 
species due to their noncommon structures of antigen 
binding loops. 
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DR01006 C TCGAG — TCTGGGG6AGG 

DR27006 C TCGAG — TCTGGGGGAGG 

DR03006 C AGGTGA AACTGCTCGAG — TCTGGAGGAGG 

DR11006 C TCGAG— TCTGGGGGAGG 

DR24006 C AGGTGA AACTGCTCGAG— TCTGGGGGAGG 

DR16006 C TCGAG — TCTGGAGGAGG 

DR19006 C TCGAG— TCTGGAGGAGG 

DR07006 C TCGAG— TCTGGGGGAGG 

DR16006 C TCGAG— TCTGGGGGAGG 

DR20006 C TCGAG— TCAGGGGGAGG 

DR25006 C TCGAG— TCTGGGGGAGG 

DR20006 C TCGAG — TCTGGAGGAGG 

DR21006 C TCGAG— TCTGGGGGAGG 

DR09006 C AGGTGA AACTGCTCGAG — TCTGGGGGAGG 

DR17006 C TCGAG— TCTGGGGGAGG 

DR13006 C TCGAG— TCAGGGGGAGG 

DR02006 CTCGAGTCAGGTGTCCGGTCTGATGTGCAGCTGGTGGCGTCTGGGGGAGG 

DR01006 ATCGGTGCAGGCTGGAGGGTCTCTGAGACTCTC--GTGM 

DR27006 CTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTGCATCTTCTTCTA 

DR03006 CTCGGTGCAGACTGGAGGATCTCTGAGACTCTCCTGTGCAGT— C-TCTG 

DR11006 GTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTAATGT^ 

DR2ii006 GTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTAATGT--C-T 

DR16006 CTCGGCGCAGGCTGGAGGATCTCTGAGACTCTCCTGTGCAGC--CCACGG 

DR19006 CTCGGTTCAGGCTGGAGGGTCCCTTAGACTCTCCTGTGCAGC--C-TCTG 

DR07006 CTCGGTGCAGGGTGGAGGGTCTCTGAGACTCTCCTGTGCAA — TCTCTG 

DR16006 CTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGTACAG— GCTCTG 

DR20006 CTCGGTACAGGTTGGAGGGTCTCTGAGACTCTCCTGTGTAG — CCTCTA 

DR25006 CTCGGTACAAACTGGAGGGTCTCTGAGACTCTCTTGCG— AAATCTCTG 

DR20006 CTCGGTGCAGGCTGGAGGGTCTCTGAGACTCTCCTGT6— TAGCCTCTG 

DR21006 CTCGGTGCAGGTT6GAGGGTCTCTGAAACTCTCCTGTAAAAT---CTCTG 

DR09006 CTCGGTGCAGGCTGGGGGGTCTCTGACACTCTCTTGTG---TATACAC-- 

DR17006 CTCGGTCCAACCTGGAGGATCTCTGACACTCTCCTGTACAGTT--^ 

DR13006 CTCGGTGGAGGCTGGAGGGTCTCTGAGACTCTCCTGTACAG— CCTCTG 

DR02006 CTCGGTGCAGGCTGGAGGCTCTCTGAGACTCTCCTGTACAG— CCTCTG 

DR01006 GA--TACAGTAATT---GTCCCCTCACTTG-GAGCTGGTATCGCCAGTTT 

DR27006 AA--TATATGCCTT — GCACCTACGACAT-GACCTGGTACCGCCAGGCT 

DR03006 GA--TTCTCCTTTA— GTACCAGTTGTAT-GGCCTGGTTCCGCCAGGCT 

DR11006 GC--TCTCCCAGTA — GTACTTATTGCCT-GGGCTGGTTCCGCCAGGCT 

DR24006 GC--TCTCCCAGTA — GTACTTATTGCCT-GGGCTGGTTCCGCCAGGCT 

DR16006 GA--TTCCGC-TCA — ATGGTTACTACAT-CGCCTGGTTCCGTCAGGCT 

DR19006 AC— TACACCATCA— CTGATTATTGCAT-GGCCTGGTTCCGCCAGGCT 

DR07006 GA--TACACGTACG — GTAGCTTCTGTAT-GGGCTGGHCCGCGAGGGT 

DR16006 GA--TTCCCCTATA — GTACCTTCTGTCT-GGGGTGGTTCCGCCAGGCT 

DR20006 CT--CACACCGACA — GTA6CACCTGTAT-AGGCTGGTTCCGCCAGGCT 

DR25006 GA--TTGACTTTTG — ATGATTCTGACGT-GGGGTGGTACCGCCAGGCT 

DR20006 GA--TTCAATTTCG — AAACTTCTCGTAT-GGCGTGGTACCGCCAGACT 

DR21006 GAGGTACCCCAGATCGTGTTCCTAAATCTTTGGCCTGGTTCCGCCAGGCT 

DR09006 CAACGATACTGGGACCA TGGGATGGTTTCGCCAGGCT 

DR17006 --GGGCCACCTACA — GTGACTACAGTATTGGA-TGGATCCGCCAGGCT 

DR13006 G ATACGTAT-CCT CTATGGCCTGGTTCCGCCAGGTT 

DR02006 GAGA CAGTTTCAGTAGATT--TGCCATGTCTTGGTTCCGCCAGGCT 
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DR01006 CCAGGAACGGAGCGC6AGTTC6TCTCCAGTAT6GATCCG6ATG6AAATAC 

DR27006 CCAGGCAAGGAGCGCGAATTTGTCTCAAGTATAAAJATJG^ 

DR03006 TCA6GAAAGCAGCGTGAGGGGGTCGCAGCCATTAATAGT6GCGGTGGTAG 

DR1 1006 CCAGGGAGGGAGCGTGAGGGGGTCACAGCGATTAA CACTGATGG 

DR24006 CCAG6GAAGGAGCGTGAGGGGGTCACAGCGATTAA------CACTGATGG 

)R16006 CCTGGGAAGGGGCGTGAGGGGGTCGCAACAATTAATGGTGGTCG------ 

)R19006 CCAGGGAAGGAGCGTGAATTGGTCGCAGCGATTCAAGTTGTCCGTAGTGA 

)R07006 CCAGGCAAGGAACGTGAGGGGATCGCAACTATTCTTAATGGTGGTACTAA 

)R16006 CCAGGGAAGGAGCGTGAGGGGGTCGCGGGTATTAATAGTGCAGGAGGTAA 

)R20006 CCAGGGAAGGAGCGCGAGGGGGTCGCAAGTATATATTTTGGTGATGGTGG 

)R25006 CCAGGGCATGAGTGCAAATTGGTCTCAGGTATTCTGAGTGATGGTACT-C 

3R20006 CCAGGAAATGTGTGTGAGTTGGTCTCAAGTATTTACAGTGATGG------ 

DR21006 CCAGAGAAGGAGCGCGAGGGGATCGCAGTTCTTTCGACTAAGGATGGTAA 

DR09006 CCAGGGAAAGAGTGCGAAAGGGTCGCGCATATTACGCCTGATGGTATGA- 

DR17006 CCAGGGAAGGACCGTGAAGTAGTCGCAGCCGCTAATACTGGTG 

DR13006 CCAGGGCAGGAGCGCGAGGGGGTCGCGTTTGTTCAAACGG 

DR02006 CCAGGGAAGGAGTGCGAATTGGTCTCAAGCATTCAAAGTAATGGAAGGAC 

DR01006 CAAGTACA CATACTCC6TGAAGGGCCGCTTCACC 

DR27006 AACATACG CAGACTCCGTGAAGGGCCGATTCACC 

DR03006 GACATACTA-CAACACATATGTCGCCGAGTCCGTGAAGGGCCGATTCGCC 

DR11006 CAGTATCAT-ATACGCA GCCGACTCCGTGAAGGGCCGATTCACC 

DR2W06 CAGT6TCAT-ATACGCA GCCGACTCCGTGAAGGGCCGATTCACC 

DR16006 CGA-CGTCACATACTACGCCGACTCCGTGACGGGCCGATTTACC 

DR19006 TACT--CGC-C-TCACAGACTACGCCGACTCCGTGAAGGGACGATTCACC 

DR07006 CACATACTATGCCGACTCGGTGAAGGGCCGATTCACC 

DR16006 TACTTACTATGCCGACGCCGTGAAGGGCCGATTCACC 

DR20006 TACGAATTATCGCGACTCCGTGAAGGGCCGATTCACC 

DR25006 CATATACAAAGAGTGGAGACTATGCTGAGTCTGTGAGGGGCCGGGTTACC 

DR20006 CA-AAACATACTACGTCGACC--GCA TGAAGGGCCGATTCACC 

DR21006 GA CATTCTATGCCGACTCCGTGAAGGGCCGATTCACC 

DR09006 CCTTCATTGATGAACCCGTGAAGGGGCGATTCACG 

DR17006 CGACTAGTAAATTCTACGTCGACTTTGTGAAGGGCCGATTCACC 

DR13006 --CTGACAAT-AGTGCATTATATGGCGACTCCGTGAAGGGCCGATTCACC 

DR02006 AACTGA GGCCGATTCCGTGCAAGGCCGATTCACC 

DR01006 ATGTCCCGAGGCAGCACCGAGTACACAGTATTTCTGCAAATGGACAATCT 

DR27006 ATCTCCCAAGACAGCGCCAAGAACACGGTGTATCTGCAGATGAACAGCCT 

DR03006 ATCTCCCAAGACAACGCCAAGACCACGGTATATCTTGATATGAACAACCT 

DR11006 ATCTCCCAAGACACCGCCAAGGAAACGGTACATCTCCAGATGAACAACCT 

DR24006 ATCTCCCAAGACACCGCCAAGAAAACGGTATATCTCCAGATGAACAACCT 

DR16006 ATCTCCCGAGACAGCCCCAAGAflTACGGTGTATCTGCAGATGAACAGCCT 

DR19006 ATCTCCCAAGGCAACACCAAGAACACAGTGAATCTGCAAATGAACAGCCT 

DR07006 ATCTCCCAAGACAGCACGTTGAAGACGATGTATCTGCTAATGAACAACCT 

DR16006 ATCTCCCAAGGGAATGCCAAGAATACGGTGTTTCTGCAAATGGATAACTT 

DR20006 ATCTCCCAACTCAACGCCCAGAACACAGTGTATCTGCAAATGAACAGCCT 

DR25006 ATCTCCAGAGACAACGCCAAGAACATGATATACCTTCAAATGAACGACCT 

DR20006 ATTTCTAGAGAGAATGCCAAGAATACATTGTATCTACAACTGAGCGGCCT 

DR21006 ATCTTCTTAGATAATGACAAGACCACTTTCTCCTTACAACTTGATCGACT 

DR09006 ATCTCCCGAGACAACGCCCAGAAAACGTTGTCTTTGCGAATGAATAGTCT 

DR17006 ATTTCCCAAGACAACGCCAAGAATACGGTATATCTGCAAATGAGCTTCCT 

DR13006 ATCTCCCACGACAACGCCAAGAACACGCTGTATCTGCAAATGCGCAACCT 

DR02006 ATCTCCCGAGACAATTCCAGGAACACAGT6TATCT6CAAATGAACAGCCT 
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DR01006 GAAACCTGAGGACACGGCGATGTATTACTGTAAAAC-A — GCCCTAC-- 

DR27006 GAAACCTGAGGACACGGCGATGTATTACTGTAAAAT-A — GA--TTC-- 

DR05006 AACCCCTGAAGACACGGCTACGTATTACTGTGCGGCGG — TCCCAGCCC 

DR11006 GCAACCTGAGGATACGGCCACCTATTACTGCGCGGCAA — GACTGACGG 

DR24006 GCAACCTGAGGATACGGCCACCTATTACTGCGCGGCAA — GACTGACGG 

DR16006 GAAACCTGAGGACACGGCCATCTACTTCTGTGCAGCAG — G CTC 

DR19006 GACACCTGAGGACACGGCCATCTACAGTTGTGCGGCAA— C CAG 

DR07006 GAAACCTGAAGACACGGGCACCTATTACTGTGCTG-CA — GAACTAAGT 

DR16006 GAAACCTGAGGACACGGCCATCTATTACTGCGCGG-CG — GATAGTCCA 

DR20006 GAAACCTGAGGACAGCGCCAT6TACTACTGTGCAATCA — CTGAAATTG 

DR25006 GAAACCTGAGGACACGGCCATGTATTACTGCGCGGTAGATGGTTGGACCC 

DR20006 CAAACCTGAGGACACGGCCATGTATTACTGTGCG CC 

DR21005 GAACCCGGAGGACACTGCCGACTACTACTGCGCTGCAAATCAATTAGC- 

DR09006 GAGGCCTGAGGACACGGCCGTGTATTACTGTGCGGCAGATTG 

DR17006 GAAACCTGAGGACACGGCCATCTATTACTGTGCGGCAG CGGACCC 

DR13006 GCAACCTGACGACACTGGCGTGTACTACTGTGCGGCC CAA 

DR02006 GAAACCCGAGGACACGGCCGTGTATTACTGTGGGGCAGT 

DR01006 A-AC-- CTGGGGGTTATTGTGGGTA- 

DR27006 GTAC— CCGTGCCATCTCCTTGATG- 

DR03006 ACTTGGGACCT GGCG-CCATT CTTGATTTG 

DR11006 AGATGGGGGCTTGTGATGCGAGATGG6CGACCTTAGC--GACAAGGAC-G 

DR24006 AGATGGGGGCTTGTGATGCGAGATGGGCGACCTTAGC--GACAAGGAC-G 

DR16006 GCGTTTTT-CTAGTCCTGTTGGGAGCACTTC-TAGAC — TCGAAAGTAG 

DR19006 TAGTTTTTACTGGTACT GCAC C — ACG G 

DR07006 GGTGGTAGTTGTGAATTGC — CTTTGC TATTTGACTA 

DR15006 TGTTACATGCCGACTATGC — CCGCTCCCCCGATACGAGACAGTTTTGG 

DR20006 AGTGGTATGGGTGCAATTT — AAGGACTACTTTTACT — C G 

DR25006 GGAAGGAAG--GGGGAATCGGGTTAC CCTGGTCGGTCCAATGTGAA 

DR20006 GGTTGAA TATC CTATTGCAGAC--ATGTGTT 

DR21006 — TGGTGGCTGGTATT TGGACCCGAATTACTGG-CTCTCTGTG 

DR09006 — GAAATACTGGA CTTGTGGTGC--CCAGA-CTGG AG 

DR17006 AAGTATATATTATAGTATC CTCCNNAT 

DR13006 AAGAAGGATCGTA CTAGATGGGC CGAGCCT 

DR02006 CTCCCTAA--TGGACCGAATTTC 

DR01006 —TGGGTANTGCCTCTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR27006 --T CTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR03006 AAAAAGTATAAGTACTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR11006 TTTGCGTATAACTACTGGGGCCGGGGGACCCAGGTCACCGTCTCCTCACT 

DR24006 TTTGCGTATAACTACTGGGGCCGGGGGACCCAGGTCACCGTCTCCTCACT 

DR16006 CGA-CT-ATAACTATTGGGGCCAGGGGATCCAGGTCACCGTCACCTCACT 

DR19006 CGC-CTTATAACGTCTGGGGTCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR07006 CTGGG GCCAGGGCACCCAGGTCACCGTCTCCTCACT 

DR16006 CTGGGATGATTTT GGCCAG6GGACCCAGGTCACCGTCTCCTCACT 

DR20006 CTGGG GCCAGGGGACCCAGGTCACCGTCTCCTCACT 

DR25006 GATGGTTATAACTATTGGGGCCAGGCGACCCAGGTCACCGTCTCCTCAC- 

DR20006 CGAGAT ACG — GCGACCCGGGGACCCAGGTCACCGTCTCCTCAC- 

DR21006 GGTGCATATGCCATCTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCAC- 

DR09006 GATACTTCGGACAG-TGGGGTCAGGGGGCCCAGGTCACCGTCTCCTCACT 

DR17006 — TGAGTATAAGTACTGGGGCCA6GGGACCCAGGTCACCGTCTCCTCA- 

DR13006 CGAGAATGGAACAACTGGGGCCAGGGGACCCAGGTCACCGTCTCCTCA- 

DR02006 CCAACATGGG— TGCCGGGGCCAGGGAACCCAGGTCACCGTCTCCT 
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DR01006 AG TTACCCGTACGACGTTCCG6ACTACGGTTCTTAATAGAATTC 

DR27006 AG TTACCCGTACGAGCTTCCGGACTACGGTTCTTAATAGAATTC 

DR03006 AGCTAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR11006 AG TTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR24006 AGCTAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR16006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR19006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR07006 - — AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR16006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR20006 AGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR25006 — TAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR20006 — TAGTTACCCGTACGACGAACCGGACTACGGTTCTTAATAGAATTC 

DR21006 — TAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR09006 AGCTAGTTACCCGTACGACGTTCCGGACTACGGTTCTTAATAGAATTC 

DR17006 

DR13006 

DR02006 TA 

FIG. 7D 
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IMMUNOGLOBULINS DEVOID OF LIGHT Ward et al have described in their publication that V„ 

CHAINS domains are relatively sticky, presumably due to the exposed 
hydrophobic surface normally capped by the V K or V\ 

This is a division of application Ser. No. 08/106.944, domains. They consequently envisage that it should be 

filedAug. 17 1993 now abandoned. 5 possible t0 ^ v domains havi ^ oy ^ proper ties 

The invention relates to new isolated unmunoglobuhns and fimher ^ v domains wi(h Mndi actMties couM 

wmch are devoid of light polypeptide chains. These unmu- serve as ^ buildi Wocks f ^ variaWe fr 

noglobulms do not consist m the degradation products of , P „ fr.,™,,,,^ ™- ™ m .,i»t,> „,t;»™«»o 

immunoglobulins composed of both h«vy polypeptide and ( FV £?™*3 Z , ^ t T« t ,h ■„ 

light polypeptide chains but to the contra^, ^invention , n ... The /l™ 0 " f, 06 * n « ' tart fr ° m * e ' de u a toe 

defines a new member of the family of the 10 Cerent fragments (hght and heavy chams) and the different 

immunoglobulins, especially a new type of molecules dornains °f these fragments of four-chain model immuno- 

capable of being involved in the immune recognition. Such globulin can be modified to define new or improved antigen 

immunoglobulins can be used for several puiposes. espe- binding sites or a four-chain model immunoglobulin, 

daily for diagnosis or therapeutical purposes including The inventors have determined that inmunoglobulins can 

protection against pathological agents or regulation of the 13 have a different structure than the known four-chain model 

expression or activity of proteins. and that such different immunoglobulins offer new means 

Up to now the structure proposed for immunoglobulins for the preparation of diagnosis reagents, therapeutical 

consists of a four-chain model referring to the presence of agents or any other reagent for use in research or industrial 

two identical light polypeptide chains (light chains) and two purposes. 

identical heavy polypeptide chains (heavy chains) linked 20 Thus the invention provides new immunoglobulins 

together by disulfide bonds to form a y- or T-shaped mac- which are capable of showing functional properties of 

romolecules. These chains are composed of a constant four-chain model immunoglobulins although their structure 

region and a variable region, the constant region being appears to be more appropriate in many circumstances for 

subdivided in several domains. The two heavy polypeptide their use. their preparation and in some cases for their 

chains are usually linked by disulphide bounds in a so-called 25 modification. Moreover these molecules can be considered 

"hinge region" situated between the first and second as lead structures for the modification of other immunoglo- 

domains of the constant region. bulins. The advantages which are provided by these immu- 

Among the proteins forming the class of the noglobulins comprise the possibility toprepare them with an 

immunoglobulins, most of them are antibodies and accord- increased facility. 

ingly present an antigen binding site or several antigen 30 The invention accordingly relates to immunoglobulins 

binding sites. characterized in that they comprise two heavy polypeptide 

According to the four-chain model, the antigen binding chains sufficient for me formation of a complete antigen 

site of an antibody is located in the variable domains of each binding site or several antigen binding sites, these immu 

of the heavy and light chains, and requires the association of noglobulins being further devoid of light polypeptide chains, 

the heavy and the light chains variable domains. 35 In a particular embodiment of the invention, these immu- 

For the definition of these four-chain model noglobulins are further characterized by the fact that they are 

immunoglobulins, reference is made to Roitt. I et al the product of the expression in a prokaryotic or in a 

(Immunology-se«ond-Edition Gower Medical Publishing eukaryotic host cell, of a DNA or of a cDNA having the 

USA, 1989). Reference is especially made to the part sequence of an immunoglobulin devoid of fight chains as 

concerning the definition of the four-chain 40 obtainable from lymphocytes or other cells of Camelids. 

immunoglobulins, their polypeptidic and genetic structures. The immunoglobulins of the invention can be obtained 

the definition of their variable and constant regions and the for example from the sequences which are described in FIG. 

obtention of the fragments produced by enzymatic degra- 7. 

dation according to well known techniques. The immunoglobulins of the invention, which are devoid 

The inventors have surprisingly established that different 45 of light chains are such that the variable domains of their 

molecules can be isolated from animals which naturally heavy chains have properties differing from those of the 

produce them, which molecules have functional properties four-chain immunoglobulin V„. The variable domain of a 

of immunoglobulins these functions being in some cases heavy-chain immunoglobulin of the invention has no normal 

related to structural elements which are distinct from those interaction sites with the or with the C„l domain which 

involved in the function of four-chain immunoglobulins due 50 do not exist in the heavy chain immunoglobulins it is hence 

for instance to the absence of light chains. a novel fragment in many of its properties such as solubility 

The invention relates to two-chain model immunoglo- and position of the binding site. For clarity reasons we will 

bulins which neither correspond to fragments obtained for call it V HH in this text to distinguish it from the classical V„ 

instance by the degradation in particular the enzymatic of four-chain immunoglobulins. 

degradation of a natural four-chain model immunoglobulin. 55 By "a complete antigen binding site" it is meant accord- 

nor correspond to the expression in host cells, of DNA ing to the invention, a site which will alone allow the 

coding for the constant or the variable region of a natural recognition and complete binding of an antigen. This could 

four-chain model immunoglobulin or a part of these regions, be verified by any known method regarding the testing of the 

nor correspond to antibodies produced in lymphopaties for binding affinity. 

example in mice, rats or human. 60 These immunoglobulins which can be prepared by the 

E. S. Ward et al (1) have described some experiments technique of recombinant DNA, or isolated from animals, 

performed on variable domains of heavy polypeptide chains will be sometimes called "heavy-chain immunoglobulins" in 

(V H ) or/and light polypeptide chains (Vg/Fy) to test the the following pages. In a preferred embodiment of the 

ability of these variable domains, to bind specific antigens. invention, these immunoglobulins are in a pure form 

For this purpose, a library of V„ genes was prepared from 65 In a first embodiment, the immunoglobulins of the inven- 

the spleen genomic DNA of mice previously immunized tion are obtainable in prokaryotic cells, especially in E. coli 

with these specific antigens. cells by a process comprising the steps of: 
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a) cloning in a Bluecript vector of a DNA or cDNA sequence These catalytic antibodies can be modified at the level of 
coding for the V ffl domain of an immunoglobulin devoid their biding site, by random or directed mutagenesis in order 
of light chain obtainable for instance from lymphocytes of to increase or modify their catalytic function. Reference may 
Camelids. be made to the publication of Lerner et al (TIBS November 

b) recovering the cloned fragment after amplification using 5 1987. 427^30) for the general technique for the preparation 
a 5' primer containing an Xho site and a 3' primer of such catalytic immunoglobulins. 

containing the Spe site having the following sequence According to a preferred embodiment, the imrminoglo- 

TC TTA ACT ACT GAG GAG ACG GTG ACC TG. SEQ bulins of the invention are characterized in that their variable 

ID NO: 51 regions contain in position 45. an amino-acid which is 

c) cloning the recovered fragment in phase in the immuno 10 different from leucine, proline or glutamine residue. 

PBS vector after digestion of the vector with Xho and Spe Moreover the heavy-chain immunoglobulins are not prod- 
restriction enzymes, ucts characteristic of lymphocytes of animals nor from 

d) transforming host cells, especially E. coli by transfection lymphocytes of a human patient suffering from lymphop- 
with the recombinant immuno PBS vector of step c. amies. Such immunoglobulins produced in lymphopathies 

e) recovering the expression product of the \ HH coding 15 are monoclonal in origin and result from pathogenic muta- 
sequence. for instance by using antibodies raised against tions at the genomic level. They have apparently no antigen 
the dromadary V HW domain. binding site. 

In another embodiment the immunoglobulins are hetero- The two heavy polypeptide chains of these immunoglo- 

specific immunoglobulins obtainable by a process compris- bulins can be linked by a hinge region according to the 

ing the steps of: 20 definition of Roitt et al. 

obtaining a first DNA or cDNA sequence coding for a \ HH In a particular embodiment of the invention, immunoglo- 

domain or part thereof having a determined specificity bulins corresponding to the above-defined molecules are 

against a given antigen and comprised between Xho and capable of acting as antibodies. 

Spe sites. The antigen binding site(s) of the immunoglobulins of the 

obtaining a second DNA or cDNA sequence coding for a 25 invention are located in the variable region of the heavy 

y HH domain or part thereof, having a determined sped- chain. 

ficity different from the specificity of the first DNA or In a particular group of these immunoglobulins each 

cDNA sequence and comprised between the Spe and heavy polypeptide chain contains one antigen binding site 

EcoRI sites. on its variable region, and these sites correspond to the same 

digesting an immuno PBS vector with EcoRI and Xhol 30 amino-acid sequence. 

restriction enzymes. In a further embodiment of the invention the immuno- 

ligating the obtained DNA or cDNA sequences coding for globulins are characterized in that their heavy polypeptide 

\ HH domains, so that the DNA or CDNA sequences are chains contain a variable region (V„„) and a constant region 

serially cloned in the vector. (C„) according to the definition of Roitt et al. but are devoid 

transforming a host cell, especially E. coli cell by 35 of the first domain of their constant region. This first domain 

transfection, and recovering the obtained immunoglobu- of the constant region is called C„l. 

lins. These immunoglobulins having no C„l domain are such 

In another embodiment, the immunoglobulins are obtain- that the variable region of their chains is directly linked to 

able by a process comprising the steps of: the hinge legion at the C-terminal part of the variable region. 

obtaining a DNA or CDNA sequence coding for a V H „ 40 The immunoglobulins of the type described here-above 

domain or part thereof, having a determined specific can comprise type G immunoglobulins and especially 

antigen binding site. immunoglobulins which are defined as irmnunogloblilins of 

amplifying the obtained DNA or cDNA. using a 5' primer class 2 (IgG2) or immunoglobulins of class 3 (IgG3). 

containing an initiation codon and a HindlQ site, and a 3' The absence of the light chain and of the first constant 

primer containing a termination codon having a Xhol site. 45 domain lead to a modification of the nomenclature of the 

recombining the amplified DNA or cDNA into the Hindm immunoglobulin fragments obtained by enzymatic 

(position 2650) and Xhol (position 4067) sites of a digestion, according to Roitt et al. 

plasmid pMM984. The terms Fc and pFc on the one hand. Fc' and pFc' on the 

transfecting permissive cells especially NB-E cells with the other hand corresponding respectively to the papain and 

recombinant plasmid. 50 pepsin digestion fragments are maintained, 

recovering the obtained products. The terms Fab F(ab) 2 F(ab') 2 Fabc, Fd and Fv are no 

Successful expression can be verified with antibodies longer applicable in their original sense as these fragments 

directed against a region of a Y HH domain, especially by an have either a light chain, the variable part of the light chain 

ELISA assay. or the C H 1 domain. 

According to another particular embodiment of this 55 The fragments obtained by papain digestion and corn- 
process, the immunoglobulins are cloned in a parvovirus. posed of the \ HH domain and the hinge region will be called 

In another example these immunoglobuUns are obtainable FV^ or F(V ww h) 2 depending upon whether or not they 

by a process comprising the further cloning of a second remain linked by the disulphide bonds. 

DNA or cDNA sequence having another determined antigen In another embodiment of the invention, immunoglobu- 

binding site, in the pMM984 plasmid. 60 lins replying to the hereabove given definitions can be 

Such an Immunoglobulin can be further characterized in originating from animals especially from animals of the 

that it is obtainable by a process wherein the vector is Yep camelid family. The inventors have found out that the 

52 and the transformed recombinant cell is a yeast especially heavy-chain inimunoglobulins which are present in cam- 

S. cerevisiae. elids are not associated with a pathological situation which 

A particular Immunoglobulin is characterized in that it has 65 would induce the production of abnormal antibodies with 
a catalytic activity, especially in that it is directed against an respect to the four-chain immunoglobulins. On the basis of 

antigen mimicking an activiated state of a given substrate. a comparative study of old world camelids (Camelus bac- 
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trianus and Camelus dromaderius) and new world camelids The immunoglobulins of the invention which are devoid 

(for example Lama Paccos, Lama Glama. and Lama Q f light chains, nevertheless comprise on their heavy chains 

Vicuqna) the inventors have shown that the immunoglobu- a constant region and a variable region . ^ constaat reg i on 

lins of the invention which are devoid of light polypeptide com p ri ses different domains, 

chains are found in all species. Nevertheless differences may 5 

be apparent in molecular weight of these immunoglobulins ^ vanable re 8 lon of immunoglobulins of the invention 
depending on the animals. Especially the molecular weight comprises frameworks (FW) and complementarity deter- 
of a heavy chain contained in these immunoglobulins can be mining regions (CDR). especially 4 frameworks and 3 
from approximately 43 kd to approximately 47 kd, in complementarity regions. It is distinguished from the four- 
particular 45 kd. io chain immunoglobulins especially by the fact that this 

Advantageously the heavy-chain immunoglobulins of the variable region can itself contain an antigen binding site or 

invention are secreted in blood of camelids. seV eral, without contribution of the variable region of a light 

Immunoglobulins according to this particular embodi- cna i n wmc jj j s a t> se nt 
ment of the invention are obtainable by purification from „ . . . ' . , , « . 
serum of camelids and a process for the purification is u ^ "^muI sequences of frameworks 1 and 4 corn- 
described in details in the examples. In the case where the ? nse amon 8 others respectively amino-acid sequences 
immunoglobulins are obtained from Camelids, the invention wmcn can ^ selected from the following for the framework 
relates to immunoglobulins which are not in their natural 1 domain 
biological environment. 

According to the invention immunoglobulin IgG2 as 20 ogsvqtgoslrlscei s gl t f d seqidnoh 

obtainable by purification from the serum of camelids can be ggsvqtggslrlscavsgfsfsseqidnO:2 
characterized in that: 

it is not adsorbed by chromatography on Protein G ggseqgggslrlscaisgytygseqid ncw 

Sepharose column, ggsvqpggsltlsctvsgatys seqidno* 

it is adsorbed by chromatography on Protein A Sepharose 25 

column, " GGSVQAGGSLRLSCTGSGFPYSSEQID NO:5 

it has a molecular weight of around 100 kd after elution with GGSVQAGGSLRLSCVAGFGTSSEQID NO 6 
a pH 4.5 buffer (0. 15M NaCl. 0.58% acetic acid adjusted 

to pH 4.5 by NaOH), ggsvqaggslrlscvsfspssseqid no:7 

it consists of heavy y2 polypeptide chains of a molecular 30 

weight of around 46 kd preferably 45 after reduction. 

According to a further embodiment of the invention 
another group of immunoglobulins corresponding to IgG3, for the framework 4 domain 
as obtainable by purification from the serum of Camelids is 

characterized in that the immunoglobulin: 35 wgqgtqvtvss seq id NO:8 

is adsorbed by chromatography on a Protein A Sepharose wgqgtlvtvss seq ID no 9 

column, 

has a molecular weight of around 100 kd after elution with wgqgaqvtvss seqidno:io 

a pH 3.5 buffer (0.15M NaCl, 0.58% acetic acid). wgqgtqvtass seqidno:1i 

is adsorbed by chromatography on a Protein G Sepharose 40 

column and eluted with pH 3.5 buffer (0.15M NaCl. RGQGTQvtvsl seqidno:12 

0.58% acetic acid), 
consists of heavy y3 polypeptide chains of a molecular 

weight of around 45 Kd in particular between 43 and 47 

kd after reduction. for the CDR3 domain 

ALQPGGYCGYGX CL SEQDDNO:62 

VSLMDRI S Q H G C SEQ ID NO:63 

VPAHLGPGAI LDLKKY KY SEQIDNO:64 

FCYSTAGDGGSGE --MY SEQ ID NO:65 

ELSGGSCELPLLF D Y SEQIDNO:66 

DWKYWTCGAQTGGYF G Q SEQ ID NO:67 

RLTEMGACDARWATLATRTFAYNY SEQ ID N0:68 

QKKDRTRWAEPREW- N N SEQIDN0:69 

GSRFSSPVGSTSRLES-SDY--NY SEQ ID NO:70 

ADPSI YYSI LXIEY KY SEQ ID NO:71 

DSPCYMPTMPAPPI RDSFGW--DD SEQ ID NO:72 

TSSFYWYCTTAPY N V SEQ ID NO;73 

TBI EWYGCNLRTTF TR SEQ ID NO: 74 
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^ I SEQIDNO:75 

RL TE MGACDAR WATL AT RTF A Y N Y SEQIDNO:76 
DGWTRKEGGI GLPWSVQCEDGYNY SEQIDNO:77 
DSYPCHLL--------------DV SEQIDNO:78 

VEYPI ADMCS RY SEQ1DN0:79 



As stated above, the immunoglobulins of the invention 
are preferably devoid of the totality of their C H l domain. 

Such immunoglobulins comprise C H 2 and C H 3 domains 
in the C-terminal region with respect to the hinge region. 

According to a particular embodiment of the invention the 
constant region of the immunoglobulins comprises C H 2 and 
C H 3 domains comprising an amino-acid sequence selected 
from the following for the C„2 domain: 
APELLGGFTVFTFPPKPKDVLSrrLTP SEQ ID NO: 31 
APELPGGPSVFVFPTKPKDVLSISGRP SEQ ID NO: 32 
APELPGGPSVFVFPPKPKDVLSISGRP SEQ ID NO: 33 
APELLGGPSVFTFPPKPKDV1 SISGRP SEQ ID NO: 34 
for me C H 3 domain: 
GQTREPQVYTLA SEQ ID NO: 35 
GQTREPQVYTLAPXRLEL SEQ ID NO: 36 
GQPREPQVYTLPPSRDEL SEQ ID NO: 109 
GQPREPQVYTLPPSREEM SEQ ID NO: 110 
GQPREPQVYTLPPSQEEM SEQ ID NO: 111 

Interestingly the inventors have shown that the binge 
region of the immunoglobulins of the invention can present 
variable lengths. When these immunoglobulins act as 
antibodies, the length of the hinge region will participate to 
the determination of the distance separating the antigen 
binding sites. 

Preferably an immunoglobulin according to the invention 
is characterized in that its hinge region comprises from 0 to 
50 amino-acids. 

Particular sequences of hinge region of the immunoglo- 
bulins of the invention are the following. 
GTNEVCKCPKCP SEQ ID NO: 37 

EPKTPQPQPKPQPQPQPQPKPQPKPEPECTCPKCP SEQ 
ID NO: 38 

The short hinge region corresponds to an IgG3 molecule 
and the long hinge sequence corresponds to an IgG2 mol- 
ecule. 

Isolated V HH derived from heavy chain immunoglobulins 
or V HH libraries corresponding to the heavy chain immu- 
noglobulins can be distinguished from \ HH cloning of 
four-chain model immunoglobulins on the basis of sequence 
features characterizing heavy chain immunoglobulins. 

The camel heavy — chain immunoglobulin V HH region 
shows a number of differences with the V„„ regions derived 
from 4-chain immunoglobulins from all species examined. 
At the levels of the residues involved in the V H/ /V t 
interactions, an important difference is noted at the level of 
position 45 (FW) which is practically always leucine in the 
4-chain immunoglobulins (98%), the other amino acids at 
this position being proline (1%) or glutamine (1%). 

In the camel heavy-chain immunoglobulin, in the 
sequences examined at present, leucine at position 45 is only 
found once. It could originate from a four-chain immuno- 
globulin. In the other cases, it is replaced by arginine. 
cysteine or glutamic acid residue. The presence of charged 
amino acids at this position should contribute to making the 
V HH more soluble. 

The replacement by camelid specific residues such as 
those of position 45 appears to be interesting for the con- 



struction of engineered V HH regions derived from (he \ HH 
repertoire of 4-chain immunoglobulins. 
A second feature specific of the camelid V HH domain is 

13 the frequent presence of a cysteine in the CDR 3 region 
associated with a cysteine in the CDR, position 31 or 33 or 
FW 2 region at position 45. The possibility of establishing a 
disulphide bond between the CDR 3 region and the rest of the 
variable domain would contribute to the stability and posi- 
tioning of the binding site. 

20 With the exception of a single pathogenic myeloma 
protein (DAW) such a disulphide bond has never been 
encountered in immunoglobulin V regions derived from 4 
chain immunoglobulins. 
The heavy-chain immunoglobulins of the invention have 

25 further the particular advantage of being not sticky. Accord- 
ingly these irnmunoglobulins being present in the serum, 
aggregate much less than isolated heavy chains of a four- 
chain immunoglobulins. The immunoglobulins of the inven- 
tion are soluble to a concentration above 0.5 mg/ml. pref- 

30 erably above 1 mg/ml and more advantageously above 2 
mg/ml. 

These immunoglobulins further bear an extensive antigen 
binding repertoire and undergo affinity and specificity matu- 
ration in vivo. Accordingly they allow the isolation and the 

35 preparation of antibodies having defined specificity, regard- 
ing determined antigens. 

Another interesting property of the immunoglobulins of 
the invention is that they can be modified and especially 
humanized. Especially it is possible to replace all or part of 
the constant region of these immunoglobulins by all or part 

40 of a constant region of a human antibody. For example the 
C w 2 and/or C«3 domains of the immunoglobulin could be 
replaced by the C H 2 and/or C„3 domains of the IgG y3 
human immunoglobulin. 
In such humanized antibodies it is also possible to replace 

45 a part of the variable sequence, namely one or more of the 
framework residues which do not intervene in the binding 
site by human framework residues, or by a part of a human 
antibody. 

Conversely features (especially peptide fragments) of 
50 heavy-chain immunoglobulin V HH regions, could be intro- 
duced into the V H or V L regions derived from four-chain 
immunoglobulins with for instance the aim of achieving 
greater solubility of the immunoglobulins. 
The invention further relates to a fragment of an immu- 
55 noglobulin which has been described hereabove and espe- 
cially to a fragment selected from the following group: 
a fragment corresponding to one heavy polypeptide chain of 

an immunoglobulin devoid of light chains, 
fragments obtained by enzymatic digestion of the immuno- 
60 globulins of the invention, especially those obtained by 
partial digestion with papain leading to the Fc fragment 
(constant fragment) and leading to FV„„h fragment 
(containing the antigen binding sites of the heavy chains) 
or its dimer FfV HH h) 2 , or a fragment obtained by further 
65 digestion with papain of the Fc fragment, leading to the 
pFc fragment corresponding to the C-terminal part of the 
Fc fragment. 
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homologous fragments obtained with other proteolytic The amino-acids of the Pro-X sequence are chosen among 

enzymes, aoy natural or non natural amino-acids. 

a fragment of at least 10 preferably 20 amino acids of the fragments can be obtained by enzymatic degradation 

variable region of the immunoglobulin, or the complete of ^ i^n^^. They can ^ be obtained b 

variable region, especially a fragment corresponding to 5 . . , 

the isolated V„„ domains or to the V„„ dimers linked to !? ^ " or 2 al,Bms - °[ nucleotlde se <J ue »<* 

the hinge disulphide. codin S for Ae immunoglobulins, or they can be chemically 

a fragment corresponding to the hinge region of the synthesized, 

immunoglobulin, or to at least 6 amino acids of this hinge ^ invention also relates to anti-idiotypes antibodies 

region, io belonging to the heavy chain immunoglobulin classes. Such 

a fragment of the hinge region comprising a repeated anti-idiotypes can be produced against human or animal 

sequence of Pro-X, idiotypes. A property of these anti-idiotypes is that they can 

a fragment corresponding to at least 10 preferably 20 amino be used as idiotypic vaccines, in particular for vaccination 

acids of the constant region or to the complete constant against g i ycC) p rotems or glycolipids and where the carbo- 

regionofmeirnmunoglobulin. ^ hydrate detenrnnes the epitope. 

The invention also relates to a fragment comprising a ~ . , K , , ^ . , ^ 

related sequence. Pro-X wmchre P eatr s equenc7contains The invenhon also relates to antiidiotypes capable of 

at least 3 repeats of Pro-X. X being any amino-acid and ™™m™g idiotypes of heavy-chain immunoglobulins, 

preferably Gin (glutamine), Lys (lysine) or Glu (acide Such antiidiotype antibodies can be either syngeneic 

glutamique); a particular repeated fragment is composed of 20 antibodies or allogenic or xenogeneic antibodies, 

a 12-fold repeat of the sequence Pro-X. The invention also concerns nucleotide sequences coding 

Such a fragment can be advantageously used as a link for all or part of a protein which amino-acid sequence 

between different types of molecules. comprises a peptide sequence selected from the following: 

GGSVQTGGSLRLSCEI SGLTFD SEQIDNO:l 
GGSVQTGGSLRLSCAVSGFSFS SEQ ID NO:2 
GGSEQGGGSLRLSCAI SGYTYG SEQ ID NO: 3 
GGSVQPGGSLTLSCTVS GATYS SEQ ID NO:4 
GGSVQAGGSLRLSCTGSGFPYS SEQIDNGS 
GGS VQAGGS LRLS CVAGFGTS SEQ ID NO:6 
GGSVQAGGSLRLSCVSFSPSS SEQIDNO:7 
WGQGTQVTVSS SEQE>NO:8 
WGQGTLVTVSSSEQID NO:9 
WGQGAQVTVS S SEQIDNOilO 
WGQGTQVTAS S SEQIDNOll 
RGQGTQVTVSL SEQtt>NO:12 

ALQPGGYCGYGX CL SEQIDNO:62 

VSLMDRI SQH------------GC SEQ ID NO:63 

VPAHLGPGAI LDLKKY- --KY SEQIDNO:64 

FCYSTAGDGGSGE --MY SEQ ID NO:6S 

ELSGGSCELPLLF---------DY SEQ ID NO:66 

DWKYWTCGAQTGGYF G Q SEQ ID NO:67 

RLTEMGACDARWATLATRTFAYNY SEQ ID NO:68 

QKKDRTRWAEPREW NN SEQIDNO:69 

GSRFSSPVGSTSRLES-SDY NY SEQIDNO:70 

ADPSI YYSILXIEY KY SEQIDNO:71 

DSPCYMPTMPAPPI RDSFGW D D SEQ ID NO:72 

TSSFYWYCTTAPY N V SEQ ID NO:73 

TEI EWYGCNLRTTF T R SEQ ID NO:74 

NQL AGGWYL DP NYWL S VGAY AI SEQ ID NO:75 

RLTEMGACDARWATLAT R T F A Y N Y SEQ ID NO:76 
DGWTRKEGGI GLPWSVQCEDGYNY SEQ ID NO:77 
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-continued 

DSYPCHLL - - -- -- - -DV SEQIDNO:78 

VEYPI ADMCS----- - R Y SEQH)NO:79 

APELLGGPSVFVFPPKPKDVLSISGXPK SEQ ID NO:39 
APELPGGPSVFVFPTKPKDVLSISGRPK SEQ ID NO:40 
APELPOGPSVFVFPPKPKDVLSISGRPK SEQ ID NO:41 
APELLOGPSVF1FPPKPKDVLSISGRPK SEQ ID NO:42 
GQTREPQVYTLAPXRLEL SEQ ID NO:36 
GQPREPQVYTLPPSRDEL SEQ ID NO: 109 
GQPREPQVYTLPPSREEM SEQ ID NO; 110 
GQPREPQVYTLPPSQEEM SEQ ID NO:lll 

VTVSSGTNEVCKCPKCPAFELPGGPSVFVFP SEQ ID NO 43 

VWSSEPKIPQPQPKPQPQPQPQPKPQPKPEPHCICPKCPAPELIjGGPSVFIFP SEQ ID NO:44 
GTNEVCKCPKCP SEQ ID NO:37 
APELPGGPSVFVFP SEQ ID NO:45 

EPKIPQPQPKPQPQPQPQPKPQPKPEPECTCPKCP SEQ ID NO:38 
APELLGGPSVFIFP SEQ ID NO:46 



Such nucleotide sequences can be deduced from the 
amino-acid sequences taking into account the dcneneracy of 
the genetic code. They can be synthesized or isolated from 
cells producing immunoglobulins of the invention. 

A procedure for the obtention of such DNA sequences is 
described in the examples. 

The invention also contemplates RNA, especially mRNA 
sequences corresponding to these DNA sequences, and also 25 
corresponding cDNA sequences. 

The nucleotide sequences of the invention can further be 
used for the preparation of primers appropriate for the 
detection in cells or screening of DNA or cDNA libraries to 
isolate nucleotide sequences coding for immunoglobulins of 30 
the invention. 

Such nucleotide sequences can be used for the preparation 
of recombinant vectors and the expression of these 
sequences contained in the vectors by host cells especially 
prokaryotic cells like bacteria or also eukaryotic cells and 35 
for example CHO cells, insect cells, simian cells like Vero 
cells, or any other mammalian cells. Especially the fact that 
the immunoglobulins of the invention are devoid of light 
chains permits to secrete them in eukaryotic cells since there 
is no need to have recourse to the step consisting in the 40 
formation of the BIP protein which is required in the 
four-chain immunoglobulins. 

The inadequacies of the known methods for producing 
monoclonal antibodies or immunoglobulins by recombinant 
DNA technology comes from the necessity in the vast 45 
majority of cases to clone simultaneously the V w and V t 
domains corresponding to the specific binding site of 4 chain 
immunoglobulins. The animals and especially camelids 
which produce heavy-chain immunoglobulins according to 
the invention, and possibly other vertebrate species are 50 
capable of producing heavy-chain immunoglobulins of 
which the binding site is located exclusively in the V HH 
domain. Unlike the few heavy-chain immunoglobulins pro- 
duced in other species by chain separation or by direct 
cloning, the camelid heavy-chain immunoglobulins have 55 
undergone extensive maturation in vivo. Moreover their 
region has naturally evolved to function in absence of the 
V t . They are therefore ideal for producing monoclonal 
antibodies by recombinant DNA technology. As the obten- 
tion of specific antigen binding clones does not depend on a 60 
stochastic process necessitating a very large number of 
recombinant cells, this allows also a much more extensive 
examination of the repertoire. 

This can be done at the level of the non rearranged V HH 
repertoire using DNA derived from an arbitrarily chosen 65 
tissue or cell type or at the level of the rearranged 
repertoire, using DNA obtained from B lymphocytes. More 



interesting however is to transcribe the mRNA from anti- 
body producing cells and to clone the cDNA with or without 
prior amplification into an adequate vector. This will result 
in the obtention of antibodies which have already undergone 
affinity maturation. 

The examination of a large repertoire should prove to be 
particularly useful in the search for antibodies with catalytic 
activities. 

The invention thus provides libraries which can be gen- 
erated in a way which includes pan of the hinge sequence, 
the identification is simple as the hinge is directly attached 
to the V HH domain. 

These libraries can be obtained by cloning cDNA from 
lymphoid cells with or without prior PCR amplification. The 
PCR primers are located in the promoter, leader or frame- 
work sequences of the \ HH for the 5' primer and in the 
hinge, CH 2 , CHj, 3* untranslated region or poly A tail for the 
3' primer. A size selection of amplified material allows the 
construction of a library limited to heavy chain immuno- 
globulins. 

In a particular example, the following 3' primer in which 
a Kpnl site has been constructed and which corresponds to 
amino-acids 313 to 319 (CGC CAT CAA GGT AAC AGT 
TGA SEQ ID NO: 47) is used in conjunction with mouse 
V HW primers described by Sestry et al and containing a Xho 
site 

AG GTC CAG CTG CTC GAG TCT GG SEQ ID N0 48 
AG CTC CAG CTG CTC GAG TCT GG SEQ ID NO:49 
AG GTC CAG CTT CTC GAG TCT GG SEQ ID NO:50 
Xhol site 

These primers yield a library of camelid heavy chain 
immunoglobulins comprising the V HH region (related to 
mouse or human subgroup m). the hinge and a section of 
CH 2 . 

In another example, the cDNA is polyadenylated at its 5' 
end and the mouse specific V HH primers are replaced by a 
poly T primer with an inbuilt Xhol site, at the level of 
nucleotide 12. 
CTCGAGT 12 . 

The same 3' primer with a Kpnl site is used. 

This method generates a library containing all subgroups 
of immunoglobulins. 

Part of the interest in cloning a region encompassing the 
hinge-CH 2 link is that in both "ft and y3. a Sac site is present 
immediately after the hinge. This site allows the grafting of 
the sequence coding for the V HH and the hinge onto the Fc 
region of other immunoglobulins, in particular the human 
IgG, and IgG 3 which have the same amino acid sequence at 
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this site (Glu^g Leu 247 ). As an example, the invention 

contemplates a cDNA library composed of nucleotide cta gtg cac cac cat cac cat cac taa* tag* seo id no:52 

sequences coding for a heavy-chain immunoglobulin, such ac gtg gtg gta gtg gta gtg att a tcttaa 

as obtained by performing the following steps: seq id no:53 

a) treating a sample containing lymphoid cells, especially 5 

periferal. lymphocytes, spleen cells, lymph nodes or Hie vector containing the insert is then digested with Spel 

another lyphoid tissue from a healthy animal, especially and EcoRI to remove the resident "tag" sequence which can 

selected among the Camelids. m order to separate the be replaced by the poly-His/termination sequence. The pro- 

lymphoid cells. duced v „„ can equally be detected by using antibodies 

b) separating polyadenylated RNA from the other nucleic raised against the dromedary V ww regions. Under laboratory 
acids and components of the cells. conditions. V„„ regions are produced in the Immuno PBS 

c) reacting the obtained RNA with a reverse transcriptase in vector in mg amounts per liter. 

order to obtain the corresponding cDNA. The invention also relates to a DNA library composed of 

d) contacting the cDNA of step c) with 5' primers corre- nucleotide sequences coding for a heavy-chain 
spending to mouse W H domain of four-chain immunoglobulin, such as obtained from cells with rear- 
lmmunoglobulins. which primer contains a determined 15 ranged immunoglobulin genes. 

restriction site, for example an Xhol site and with 3' to a preferred embodiment of the invention, the library is 

primers corresponding to the N-terminal part of a C„2 prepared from cells from an animal previously immunized 

domain containing a Kpnl site. against a determined antigen. This allows the selection of 

e) amplifying the DNA, antibodies having a preselected specificity for the antigen 

f) cloning the amplified sequence in a vector, especially in 20 use <t f or immunization. 

a bluescript vector. In another embodiment of the invention, the amplification 

g) recovering the clones hybridizing with a probe corre- of the cDNA is not performed prior to the cloning of the 
spending to the sequence coding for a constant domain cDNA. 

from an isolated heavy-chain immunoglobulin. The heavy-chain of the four-chain immunoglobulins 

This cloning gives rise to clones containing DNA 25 remains sequestered in the cell by a chaperon protein (BIP) 

sequences including the sequence coding for the hinge. It until it has combined with a light chain. The binding site for 

thus permits the characterization of the subclass of the the chaperon protein is the C H 1 domain. As this domain is 

immunoglobulin and the SacI site useful for grafting the absent from the heavy chain immunoglobulins, their secre- 

FV^flh to the Fc region. tion is independent of the presence of the BIP protein or of 

The recovery of the sequences coding for the heavy-chain 30 the light chain. Moreover the inventors have shown that the 

immunoglobulins can also be achieved by the selection of obtained immunoglobulins are not sticky and accordingly 

clones containing DNA sequences having a size compatible will not abnormally aggregate. 

with the lack of the C„\ domain. The invention also relates to a process for the preparation 

It is possible according to another embodiment of the of a monoclonal antibody directed against a determined 

invention, to add the following steps between steps c) and d) 35 antigen, the antigen binding site of the antibody consisting 

of the above process: of heavy polypeptide chains and which antibody is further 

in the presence of a DNA polymerase and of deoxyribo- devoid of light polypeptide chains, which process comprises 

nucleotide triphosphates, contacting said cDNA with oli- immortalizing lymphocytes, obtained for example from the 

gonucleotide degenerated primers, which sequences are peripheral blood of Camelids previously immunized with 

capable of coding for the hinge region and N-terminal 4C a determined antigen, with an immortal cell and prefer- 

V HH domain of an immunoglobulin, the primers being ably with myeloma cells, in order to form a hybridoma, 

capable of hybridizing with the cDNA and capable of culturing the immortalized cells (hybridoma) formed and 

initiating the extension of a DNA sequence complemen- recovering the cells producing the antibodies having the 

tary to the cDNA used as template, desired specificity, 

recovering the amplified DNA. 45 The preparation of antibodies can also be performed 

The clones can be expressed in several types of expres- without a previous immunization of Camelids. 

sion vectors. As an example using a commercially available According to another process for the preparation of 

vector Immuno PBS (Huse et al: Science (1989) 246, 1275), antibodies, the recourse to the technique of the hybridoma 

clones produced in Bluescript® according to the above cell is not required. 

described procedure, are recovered by PCR using the same 50 According to such process, antibodies are prepared in 

Xhol containing 5' primer and a new 3' primer, correspond- vitro and they can be obtained by a process comprising the 

ing to residues 113-103 in the framework of the steps of: 

immunoglobulins, in which an Spe site has been con- cloning into vectors, especially into phages and more par- 

structed: TC TTA ACT AGT GAG GAG ACG GTG ACC ticularly filamentous bacteriophages, DNA or cDNA 

TG SEQ ID NO: 51. This procedure allows the cloning of 55 sequences obtained from lymphocytes especially PBLs of 

the V HH in the Xho Spe site of the Immuno PBS vector. Camelids previously immunized with determined 

However, the 3' end of the gene is not in phase with the antigens, 

identification "tag" and the stop codon of the vector. To transforming prokaryotic cells with the above vectors in 



achieve this, the construct is cut with Spe and the 4 base conditions allowing the production of the antibodies, 

overhangs are filled in, using the Klenow fragment after 60 selecting the antibodies for their heavy-chain structure and 
which the vector is religated. A further refinement consists further by subjecting them to antigen-afSnity selection, 
in replacing the marker ("tag") with a poly histidine so that recovering the antibodies having the desired specificity, 
metal purification of the cloned can be performed. To In another embodiment of the invention the cloning is 

achieve this a Spe/EcoRI double stranded oligonucleotide performed in vectors, especially into plasmids coding for 

coding for 6 histidines and a termination codon is first 65 bacterial membrane proteins. Procaryotic cells are then 
constructed by synthesis of both strands followed by heating transformed with the above vectors in conditions allowing 
and annealing: the expression of antibodies in their membrane. 
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The positive cells are further selected by antigen affinity 
selection. 

The heavy chain antibodies which do not contain the C H 1 
domain present a distinct advantage in this respect. Indeed, 
the C H 1 domain binds to BIP type chaperone proteins 
present within eukaryotic vectors and the heavy chains are 
not transported out of the endocytoplasmic reticulum unless 
light chains are present. This means that in eukaryotic cells, 
efficient cloning of 4-chain immunoglobulins in non mam- 
malian cells such as yeast cells can depend on the properties 
of the resident BIP type chaperone and can hence be very 
difficult to achieve. In this respect the heavy chain antibodies 
of the invention which lack the CH, domain present a 
distinctive advantage. 

In a preferred embodiment of the invention the cloning 
can be performed in yeast either for the production of 
antibodies or for the modification of the metabolism of the 
yeast. As example. Yep 52 vector can be used. This vector 
has the origin of replication (ORI) 2u of the yeast together 
with a selection marker Leu 2. 

The cloned gene is under the control of gall promoter and 
accordingly is inducible by galactose. Moreover, the expres- 
sion can be repressed by glucose which allows the obtention 
of very high concentration of cells before the induction. 

The cloning between BamHI and Sail sites using the same 
strategy of production of genes by PCR as the one described 
above, allows the cloning of camelid immunoglobulin genes 
in E. coli. As example of metabolic modulation which can 
be obtained by antibodies and proposed for the yeast, one 
can site the cloning of antibodies directed against cyclins, 
that is proteins involved in the regulation of the cellular 
cycle of the yeast (TIBS 16 430 J. D. McKinney, N. Heintz 
1991). Another example is the introduction by genetic 
engineering of an antibody directed against CD 28 , which 
antibody would be inducible (for instance by gall), within 
the genome of the yeast The CD 28 is involved at the level 
of the initiation of cell division, and therefore the expression 
of antibodies against this molecule would allow an efficient 
control of multiplication of the cells and the optimization of 
methods for the production in bioreactors or by means of 
immobilized cells. 

In yet another embodiment of the invention, the cloning 
vector is a plasmid or a eukaryotic virus vector and the cells 
to be transformed are eukaryotic cells, especially yeast cells, 
mammalian cells for example CHO cells or simian cells 
such as Veto cells, insect cells, plant cells, or protozoan 

For more details concerning the procedure to be applied 
in such a case, reference is made to the publication of Marks 
et al. J. Mol. Biol. 1991. 222: 581-597. 

Furthermore, starting from the immunoglobulins of the 
invention, or from fragments thereof, new immunoglobulins 
or derivatives can be prepared. 

Accordingly immunoglobulins replying to the above 
given definitions can be prepared against determined anti- 
gens. Especially the invention provides monoclonal or poly- 
clonal antibodies devoid of light polypeptide chains or 
antisera containing such antibodies and directed against 
determined antigens and for example against antigens of 
pathological agents such as bacteria, viruses or parasites. As 
example of antigens or antigenic determinants against which 
antibodies could be prepared, one can cite (he envelope 
glycoproteins of viruses or peptides thereof, such as the 
external envelope glycoprotein of a HIV virus, the surface 
antigen of the hepatitis B virus. 

Immunoglobulins of the invention can also be directed 
against a protein, hapten, carbohydrate or nucleic acid. 
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Particular antibodies according to the invention are 
directed against the galactosyla-l-3-galactose epitope. 

The immunoglobulins of the invention allow further the 
preparation of combined products such as the combination 

5 of the heavy-chain immunoglobulin or a fragment thereof 
with a toxin, an enzyme, a drug, a hormone. 

As example one can prepare the combination of a heavy- 
chain immunoglobulin bearing an antigen binding site rec- 
ognizing a myeloma immunoglobulin epitope with the abrin 
or mistletoe lectin toxin. Such a construct would have its 
uses in patient specific therapy. 

Another advantageous combination is that one can pre- 
pare between a heavy-chain immunoglobulins recognizing 
an insect gut antigen with a toxin specific for insects such as 
the toxins of the different serotypes of Bacillus thuringiensis 

15 or Bacillus sphaericus. Such a construct cloned into plants 
can be used to increase the specificity or the host range of 
existing bacterial toxins. 

The invention also proposes antibodies having different 
specificities on each heavy polypeptide chains. These 

20 multifunctional, especially bifunctional antibodies could be 
prepared by combining two heavy chains of immunoglobu- 
lins of the invention or one heavy chain of an immunoglo- 
bulin of the invention with a fragment of a four-chain model 
immunoglobulin. 

25 The invention also provides hetero- specific antibodies 
which can be used for the targetting of drugs or any 
biological substance like hormones. In particular they can be 
used to selectively target hormones or cytokines to a limited 
category of cells. Examples are a combination of a murine 
or human antibody raised against intcrleukin 2 (IL 2 ) and a 

3 heavy -chain antibody raised against CD 4 cells. This could be 
used to reactivate CD 4 cells which have lost their IL 2 

The heavy-chain immunoglobulins of the invention can 
also be used for the preparation of hetero- specific antibodies. 

35 These can be achieved either according to the above 
described method by reduction of the bridges between the 
different chains and reoxydation. according to the usual 
techniques, of two antibodies having different specificities, 
but it can also be achieved by serial cloning of two anti- 

40 bodies for instance in the Immuno pBS vector. 

In such a case, a first gene corresponding to the V HH 
domain comprised between Xho site and a Spe site is 
prepared as described above. A second gene is then prepared 
through an analogous way by using as 5' extremity a primer 

45 containing a Spe site, and as 3' extremity a primer containing 
a termination codon and an EcoRI site. The vector is then 
digested with EcoRI and Xhol and further both V HH genes 
are digested respectively by Xo Spe and by SpeEcoRI. 
After ligation, both immunoglobulin genes are serially 

50 cloned. The spacing between both genes can be increased by 
the introduction of addition codons within the 5' Spel primer. 

In a particular embodiment of the invention, the hinge 
region of IgG2 immunoglobulins according to the invention 
is semi-rigid and is thus appropriate for coupling proteins. In 

55 such an application proteins or peptides can be linked to 
various substances, especially to ligands through the hinge 
region used as spacer. Advantageously the fragment com- 
prises at least 6 amino acids. 

According to the invention it is interesting to use a 

60 sequence comprising a repeated sequence Pro-X. X being 
any arnino-acid and preferably Gin, Lys or Glu, especially a 
fragment composed of at least a 3-fold repeat and preferably 
of a 12-fold repeat, for coupling proteins to ligand, or for 
assembling different protein domains. 

65 The hinge region or a fragment thereof can also be used 
for coupling proteins to ligands or for assembling different 
protein domains. 
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Usual techniques for the coupling are appropriate and 
especially reference may be made to the technique of protein 
engineering by assembling cloned sequences. 

The antibodies according to this invention could be used 
as reagents for the diagnosis in vitro or by imaging tech- 
niques. The immunoglobulins of the invention could be 
labelled with radio-isotopes, chemical or enzymatic markers 
or chemiluminescent markers. 

As example and especially in the case of detection or 
observation with the immunoglobulins by imaging 
techniques, a label like technetium, especially technitium 99 
is advantageous. This label can be used for direct labelling 
by a coupling procedure with the immunoglobulins or 
fragments thereof or for indirect labelling after a step of 
preparation of a complex with the technitium. 

Other interesting radioactive labels are for instance 
indium and especially indium 111. or iodine, especially I 131 , 
I 125 and I 123 . 

For the description of these techniques reference is made 
to the FR patent application published under number 
2649488. 

In these applications the small size of the V HH fragment 
is a definitive advantage for penetration into tissue. 

The invention also concerns monoclonal antibodies react- 
ing with anti-idiotypes of the above-described antibodies. 

The invention also concerns cells or organisms in which 
heavy-chain immunoglobulins have been cloned. Such cells 
or organisms can be used for the purpose of producing 
heavy-chain immunoglobulins having a desired preselected 
specificity, or corresponding to a particular repertoire. They 
can also be produced for the purpose of modifying the 
metabolism of the cell which expresses them. In the case of 
modification of the metabolism of cells transformed with the 
sequences coding for heavy-chain immunoglobulins, these 
produced heavy-chain immunoglobulins are used like anti- 
sense DNA. Antisense DNA is usually involved in blocking 
the expression of certain genes such as for instance the 
variable surface antigen of trypanosomes or other patho- 
gens. Likewise, the production or the activity of certain 
proteins or enzymes could be inhibited by expressing anti- 
bodies against this protein or enzyme within the same cell. 

The invention also relates to a modified 4-chain immu- 
noglobulin or fragments thereof, the V tf regions of which 
has been partialy replaced by specific sequences or amino 
acids of heavy chain immunoglobulins, especially by 
sequences of the V NH domain. A particular modified V H 
domain of a four-chain immunoglobulin, is characterized in 
that the leucine, proline or glutamine in position 45 of the 
V H regions has been replaced by other amino acids and 
preferably by arginine, glutamic acid or cysteine. 

A further modified V H or \ L domain of a four-chain 
immunoglobulin, is characterized by linking of CDR loops 
together or to FW regions by the introduction of paired 
cysteines, the CDR region being selected between the CDR, 
and the CDR 3 , the FW region being the FW 2 region, and 
especially in which one of the cysteines introduced is in 
position 31, 33 of the CDR, or 45 of FW 2 and the other in 
CDR 3 . 

Especially the introduction of paired cysteines is such that 
the CDR 3 loop is linked to the FW2 or CDR1 domain and 
more especially the cysteine of the CDR3 of the V w is linked 
to a cysteine in position 3 1. 33 of the CDR. or in position 45 
of FW 2 . 

In another embodiment of the invention, plant cells can be 
modified by the heavy-chain immunoglobulins according to 
the invention, in order that they acquire new properties or 
increased properties. 
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The heavy-chain immunoglobulins of the invention can 
be used for gene therapy of cancer for instance by using 
antibodies directed against proteins present on the tumor 
cells. 

In such a case, the expression of one or two V H „ genes 
can be obtained by using vectors derived from parvo or 
adeno viruses. The parvo viruses are characterized by the 
fact that they are devoid of pathogenicity or almost not 
pathogenic for normal human cells and by the fact that they 
are capable of easily multiplying in cancer cells (Russel S. 
J. 1990. Immunol. Today H. 196-200). 

The heavy-chain immunoglobulins are for instance 
cloned within Hindin/Xbal sites of the infectious plasmid of 
the murine MVM virus (pMM984). (Merchlinsky et al, 
1983, J. Virol. 47, 227-232) and then placed under the 
15 control of the MVM38 promoter. 

The gene of the V HH domain is amplified by PCR by 
using a 5' primer containing an initiation codon and a 
HindlH site, the 3' primer containing a termination codon 
and a Xbal site. 
20 This construct is then inserted between positions 2650 
(HindHI) and 4067 (Xbal) within the plasmid. 

The efficiency of the cloning can be checked by transfec- 
tion. The vector containing the antibody is then introduced 
in permissive cells (NB-E) by transfeetion. 
25 The cells are recovered after two days and the presence of 
V HH regions is determined with an ELBA assay by using 
rabbit antiserum reacting with the \ HH part. 

The invention further allows the preparation of catalytic 
antibodies through different ways. The production of anti- 
30 bodies directed against components mimicking activated 
states of substrates (as example vanadate as component 
rnimicking the activated state of phosphate in order to 
produce their phosphoesterase activities, phosphonate as 
compound mimicking the peptidic binding in order to pro- 
35 duce proteases) permits to obtain antibodies having a cata- 
lytic function. Another way to obtain such antibodies con- 
sists in performing a random mutagenesis in clones of 
antibodies for example by PCR. in introducing abnormal 
bases during the amplification of clones. These amplified 
40 fragments obtained by PCR are then introduced within an 
appropriate vector for cloning. Their expression at the 
surface of the bacteria permits the detection by the substrate 
of clones having the enzymatic activity. These two 
approaches can of course be combined. Finally, on the basis 
43 of the data available on the structure, for example the data 
obtained by XRay crystallography or NMR. the modifica- 
tions can be directed. These modifications can be performed 
by usual techniques of genetic engineering or by complete 
synthesis. One advantage of the V HH of the heavy chain 
50 immunoglobulins of the invention is the fact that they are 
sufficiently soluble. 

The heavy chain immunoglobulins of the invention can 
further be produced in plant cells, especially in transgenics 
plants. As example the heavy chain immunoglobulins can be 
55 produced in plants using the pMon530 plasmid (Roger et al. 
Meth Enzym 153 1566 1987) constitutive plant expression 
vector as has been described for classical four chain anti- 
bodies (Hiat et al. Nature 342 76-78. 1989) once again using 
the appropriate PCR primers as described above, to generate 
60 a DNA fragment in the right phase. 

Other advantages and characteristics of the invention will 
become apparent in the examples and figures which follow. 
FIGURES 

65 FIG. 1 Characterisation and purification of camel IgG by 
affinity chromatography on Protein A and Protein G 
sepharose (Pharmacia). 
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(A) shows, after reduction, the SDS-PAGE protein profile of 
the adsorbed and non adsorbed fractions of Camelus drom- 
edarius serum. The fraction adsorbed on Protein A and 
eluted with NaCl 0.15M acetic acid 0.58% show upon 
reduction (lane c) three heavy chain components of respec- 5 
tively 50. 46 and 43 Kd and light chain (rabbit IgG in lane 
a). The fractions adsorbed on a Protein G Sepharose 
(Pharmacia) derivative which has been engineered to delete 
the albumin binding region (lane e) and eluted with 0.1M gly 
HC1 pH 2.7 lacks the 46 Kd heavy chain which is recovered 

in the non adsorbed fraction (lane f). None of these com- 
ponents are present in the fraction non adsorbed on Protein 
A (lane d), lane b contains the molecular weight markers. 

(B) and (C) By differential elution. immunoglobulin frac- 
tions containing the 50 and 43 Kd heavy chain can be 
separated. 5 ml of C. dromadarius serum is adsorbed onto a 13 
5 ml Protein G sepharose column and the column is exten- 
sively washed with 20 mM phosphate buffer. pH 7.0. Upon 
elution with pH 3.5 buffer (0.15M NaCl, 0.58% acetic acid) 

a 100 Kd component is eluted which upon reduction yields 
a 43 Kd heavy chain, (lane 1). After column eluant absor- 20 
bance has fallen to background level a second immunoglo- 
bulin component of 170 Kd can be eluted with pH 2.7 buffer 
(0.1M glycine HC). This fraction upon reduction yields a 50 
Kd heavy chain and a board light chain band (lane 2). The 
fraction non adsorbed on Protein G is then brought on a 5 ml 25 
Protein A Sepharose column. After washing and elution with 
pH 3.5 buffer (0.15M NaCl. 0.58% acetic acid) a third 
immunoglobulin of 100 Kd is obtained which consists solely 
of 46 Kd heavy chains (lane 3). 

FIG. 2: Immunoglobulins of Camelus bactrianus, Lama 30 
vicugna, Lama glama and Lama pacos to Protein A (A lanes) 
and to Protein G (G lanes) analyzed on SDS-PAGE before 
(A) and after reduction (B). 

10 ul of serum obtained from the different species wera 
added to Eppendorf" tubes containing 10 mg of Protein A or 35 
Protein G sepharose suspended in 400 ul of pH 8.3 immu- 
noprecipitation buffer (NaCl O.2.M. Tris 0.01M; EDTA 
0.01M. Triton X100 1%, ovalbumin 0.1%). The tubes were 
slowly rotated for 2 hours at 4° C. After centrifugation the 
pellets were washed 3 times in buffer and once in buffer in 40 
which the Triton and ovalbumin had been ommitted. The 
pellets were then resuspended in the SDS-PAGE sample 
solution 70 ul per pellet with or without dithiotreitol as 
reductant. After boiling for 3 min at 100° C, the tubes were 
centrifuged and the supernatants analysed, 45 

In all species examined the unreduced fractions (A) 
contain in addition to molecules of approximately 170 Kd 
also smaller major components of approximately 100 Kd. In 
the reduced sample (B) the constituant heavy and light 
chains are detected. In all species a heavy chain component 50 
(marked by an asterisk *) is present in the material eluted 
from the Protein A but absent in the material eluted from the 
Protein G. 

FIG. 3 IgGj, IgG 2 and IgG 3 were prepared from serum 
obtained from healthy or Trypanosoma evansi infected Cam- 55 
elus dromedarius (CATT titer Vieo (3) and analysed by 
radioimmunopreci- pitation or Western Blotting for and 
trypanosome activity. 

(A) 35 S methionine labelled Trypanosome evansi antigens 
lysate (500.000 counts) was added to Eppendorf tubes 60 
containing 10 ul of serum or, 20 ug of IgG^ IgG 2 or IgG 3 
in 200 ul of pH 8.3 immunoprecipitation buffer containing 
0.1M TLCK as proteinase inhibitor and slowly rotated at 4° 
C. during one hour. The tubes were then supplemented with 
10 mg of Protein A Sepharose suspended in 200 ul of the 65 
same pH 8.3 buffer and incubated at 4° C. for an additional 
hour. 
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After washing and centrifugation at 15000 rpm for 12 s, 
each pellet was resuspended in 75 ul SDS-PAGE sample 
solution containing BTT and heated for 3 min. at 100° C. 
After centrifugation in an Eppendorf minifuge at 15000 rpm 
for 30 s, 5 ul of the supernatant was saved for radioactivity 
determination and the reminder analysed by SDS-PAGE and 
fluorography. The counts/5 ul sample are inscribed on for 
each line. 

(B) 20 ug of IgGj, IgG 2 and IgG 3 from healthy and 
trypanosome infected animals were separated by SDS- 
PAGE without prior reduction or heating. The separated 
samples were then electro transferred to a nitrocellulose 
membrane, one part of the membrane was stained with 
Ponceau Red to localise the protein material and the 
reminder incubated with 1% ovalbumin in TST buffer (Tris 
10 mM. NaCl 150 mM. Tween 0.05%) to block protein 
binding sites. 

After blocking, the membrane was extensively washed 
with TST buffer and incubated for 2 hours with 35 S-labelled 
trypanosome antigen. After extensive washing, the mem- 
brane was dried and analysed by autoradiography. To avoid 
background and unspeciflc binding, the labelled trypano- 
some lysate was filtered through a 45u millipore filter and 
incubated with healthy camel immunoglobulin and ovalbu- 
min adsorbed on a nitrocellulose membrane. 

FIG. 4: Purified IgG3 of fee camel, by affinity chroma- 
tography on Protein A Sepharose are partially digested with 
papain and separated on Protein A sepharose. 

14 mg of purified IgG3 were dissolved in 0. 1M phosphate 
buffer pH 7.0 containing 2mM EDTA. They were digested 
by 1 hour incubation at 37° C. with mercurypapain (1% 
enzyme to protein ratio) activated by 5.10 4 M cysteine. The 
digestion was blocked by fee addition of excess iodoaceta- 
mide (4.10 2 M)(13). After centrifugation of the digest in an 
ependorf centrifuge for 5 min at 15000 rpm. the papain 
fragments were separated on a protein A Sepharose column 
into binding (B) and non binding (NB) fractions. The 
binding fraction was eluted from the column with 0.1M 
glycine HC1 buffer pH 1.7. 

FIG. 5: Schematic presentation of a model for IgG3 
molecules devoid of light chains. 

FIG. 6: 

Schematic representation of immunoglobulins having heavy 
polypeptide chains and devoid of light chains, regarding 
conventional four-chain model immunoglobulin. 
Representation of a hinge region. 

FIG. 7: Alignment of 17 V HH DNA sequences of Camel 
heavy chain immunoglobulins SEQ H> NO: 92-108 

FIG. 8: Expression and purification of the camel V HH 21 
protein from E. coli. 

I HEAVY CHAIN ANTIBODIES IN CAMELIDS 
When Camelus dromedarius serum is adsorbed on Protein 
G sepharose. an appreciable amount (25-35%) of immuno- 
globulins (Ig) remains in solution which can then be recov- 
ered by affinity chromatography on Protein A sepharose 
(FIG. 1A). The fraction adsorbed on Protein G can be 
differentially eluted into a tightly bound fraction (25%) 
consisting of molecules of an unreduced apparent molecular 
weight (MW) of 170 Kd and a more weakly bound fraction 
(30-45%) having an apparent molecular weight of 100 Kd 
(FIG. IB). The 170 Kd component when reduced yields 50 
Kd heavy chains and large 30 Kd light chains. The 100 Kd 
fraction is totally devoid of light chains and appears to be 
solely composed of heavy chains which after reduction have 
on apparent MW of 43 Kd (FIG. 1C). The fraction which 
does not bind to Protein G can be affinity purified and eluted 
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from a Protein A column as a second 100 Kd component In contrast, the I.E.F. pattern of IgG x is completely 

which after reduction appears to be composed solely of 46 modified after reduction as the isoelectric point of each 

Kd heavy chains. molecule is determined by the combination of the isoelectric 

The heavy chain immoglobulins devoid of light chains points of the light and heavy chains which after separation 
total up to 75% of the molecules binding to Protein A. 5 will each migrate to a different position. 

As all three immunoglobulins bind to Protein A we refer n^e findings indicate that the heavy chains alone can 

< f^t • ^ T^WlW ^ and , heav y <*2l V generate an extensive repertoire and question the contribu- 
(50 Kd) binding to Protein G, IgG 2 (heavy chain 72 (46 Kd) 
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sub(classes) can be further subdivided. p y ' 

A comparative study of old world camelids (Camelus Normally, isolated heavy chain from mammalian immu- 

bactrianus and Camelus dromedarius) and new world cam- noglobuhns tend to aggregate considerably but are only 

elids {lama pacos, lama gtama, lama vicugna) showed that solubilized by light chains (8. 9) which bind to the C w l 

heavy chain immunoglobulins are found in all species 15 domain of the heavy chain. 

examined, albeit with minor differences in apparent molecu- In humans and in mice a number of spontaneous or 

lar weight and proportion. The new world camelids differs induced myelomas produce a pathological immunoglobulin 

from the old world carelids in having a larger IgG 3 molecule solely composed of heavy chains (heavy chain disease), 

(heavy chain immunoglobulin binding to Protein G) in These myeloma protein heavy chains carry deletions in the 

which the constituant heavy chains have an apparent C„l and V WH domains (10). The reason why full length 

molecular weight of 47 Kd (FIG. 2). 20 heavy chaijls do not give rise t0 sec reted heavy chain in such 

The abundance of the heavy chain immunoglobulins in pathological immunoglobulins seems to stem from the fact 

the serum of camelids raises the question of what their role that the synthesis of Ig involves a chaperoning protein, the 

is in the immune response and in particular whether they immunoglobulin heavy chain binding protein or BIP (11). 

bear antigen binding specificity and if so how extensive is which normally is replaced by the light chain (12) It is 

the repertoire. This question could be answered by exam- 25 possible that the primordial role of the light chain in the 

lmng the immunoglobulins from Trypanosoma evansi four-chain model immunoglobulins is that of a committed 

infected camels (Camelus dromedarius). heavy chain chaperon and that the emergence of light chain 

For this purpose, the corresponding fractions of IgGi. repertoires has just been an evolutionary bonus. 

IgG 2 . IgG 3 were prepared from the serum of a healthy camel The camelid y2 and v3 chains are considerably shorter 

and from the serum of camels with a high antitrypanosome than the normal mammalian y chain. This would suggest that 

titer, measured by the Card Agglutination Test (3). In radio- deletions have occurred in the C„ 1 domain. Differences in 

unmunoprecipitation. IgG^ IgG 2 and IgG 3 derived from si2es of me y2 and y3 immunoglobulins of old and new 

infected camel indicating extensive repertoire heterogeneity world came iids suggests that deletions occurred in several 

and complexity (FIG. 3A) were shown to bind a large evolutionary steps especially in the C„l domain, 
number of antigens present in a J5 S methionine labelled 

trypanosome lysate. H THE HEAVY CHAIN IMMUNOGLOBULINS 

In blotting experiments 35 S methionine labelled trypano- 0F l Hii CAMELIDS LACK THE C„l DOMAIN 

some lysate binds to SDS PAGE separated IgGj. IgG 2 and The strategy followed for investigating the heavy chain 

IgG 3 obtained from infected animals (FIG. 3B). 4q immunoglobulin primary structure is a combination of pro- 

This leads us to conclude that the camelid heavy chain tein and cDNA sequencing; the protein sequencing is nec- 

IgG 2 and IgG 3 are bona fide antigen binding antibodies. essary to identify sequence stretches characteristic of each 

An immunological paradigm states that an extensive immunoglobulin. The N-terminal of the immunoglobulin 

antibody repertoire is generated by the combination of the being derived from the heavy chain variable region reper- 

light and heavy chain variable V region repertoires (6). The 45 toire only yields information on the V HH subgroups 

heavy chain immunoglobulins of the camel seem to contra- (variable region of the heavy chain) and cannot be used for 

diet this paradigm. class or subclass identification. This means that sequence 

Immunoglobulins are characterized by a complex LE.F. nad to De obtained from internal enzymatic or chemical 

(isoelectric focussing) pattern reflecting their extreme het- cleavage sites. 

erogeneity. To determine whether the two heavy chains 50 A combination of papain digestion and Protein A affinity 

constituting the IgG 2 and IgG 3 are identical or not, the chromatography allowed the separation of various frag- 

isoelectric focussing (LE.F.) pattern were observed before ments yielding information on the general structure of IgG 3 

and after chain separation by reduction and alkylation using The IgG3 of the camel (Camelus dromedarius) purified by 

iodoacetamide as alkylating agent. affinity chromatography on Protein A Sepharose were par- 

As this alkylating agent does not introduce additional 55 tially digested with papain and the digest was separated on 

charges in the molecule, the monomers resulting from the Protein A Sepharose into binding and non binding fractions, 

reduction and alkylation of a heavy chain homodimer will These fractions were analysed by SDS PAGE under reduc- 

have practically the same isolectric point as the dimer, ing and non reducing conditions (FIG. 4). 

whereas if they are derived from a heavy chain heterodimer. The bound fraction contained two components, one of 28 

the monomers will in most cases differ sufficiently in iso- 60 Kd and one of 14.4 Kd. in addition to uncleaved or partially 

electric point to generate a different pattern in I.E.F. cleaved material. They were well separated by gel electro- 

Upon reduction, and alkylation by iodoacetamide the phoresis (from preparative 19% SDS-PAGE gels) under non 

observed pattern is not modified for the Camelus drom- reducing conditions and were further purified by electroelu- 

edarius IgG 2 and IgG 3 indicating that these molecules are tion (in 50 nM amonium bicarbonate, 0.1% (w/v) SDS using 

each composed of two identical heavy chains which migrate 65 a BioRad electro-eluter). After lyophilization of these elec- 

to the same position as the unreduced molecule they origi- troeluted fractions, the remaining SDS was eliminated by 

nated from. precipitating the protein by the addition of 90% ethanol. 
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mixing and incubating the mixture overnight at -20° C. (14). 94° C. (denature) 1 min at 54° C. (anneal). 2 min at 72° C. 

The precipitated protein was collected in a pellet by centri- (elongate). To amplify DNA sequences between V HH and 

fuging (15000 rpm, 5 min) and was used for protein C H 2 domains. (#72 clones), the PCR was performed in the 

sequencing. N-terminal sequencing was performed using the same conditions with the exception that the annealing tem- 

automated Edman chemistry of an Applied Biosystem 477A 5 perature was increased to 60° C. 

pulsed liquid protein sequencer. Amino acids were identified One clone examined (#56/36) had a sequence correspond- 
as their phenylthiohydantoin (PTH) derivatives using an j ng to th e N-terminal part of a C H 2 domain identical to the 
Applied Biosystem 120 FTH analyser. All chemical and sequence of the 28 Kd fragment. The availability of this 
reagents were purchased from Applied Biosystems. Analysis sequence data allowed the construction of an exact 3' primer 
of the chromatographic data was performed using Applied 10 an d the cloning of the region between the N-terminal end of 
Biosystems software version 1.61. In every case the com- the V HH and the C H 2 domain. 

puter aided sequence analysis was confirmed by direct 5 , primers corTesponding t0 me mouse y m (16) and 

inspection of the chromatograms from the PTH analyser. containing a Xhol restriction site were used in conjunction 

Samples for protein sequencing were dissolved in either wift fte 3, in whkn a site had ^ mserted mi 

50% (v/v) trifluoroacetic acid (TFA) (28 Kd fragment) or 15 ^ amplified sequences were cloned into pBluescript*. 

100% TFA (14 Kd fragment). Samples of dissolved protein clone #56/36 whkh ^^yed two internal Haem sites was 

equivalent to 2000 pmol (28 Kd fragment) or 500 pmol (14 ^ ^ ^ ^ me tQ duce a probe (0 identify 

Kd fragment) were applied to TFA-treated glass fibre discs. pCR posing c i ones 

The glass fibre discs were coated with BioBrene (3 mg) and ^ ffication me rcR oducts wc chccked on „ 

precycled once before use. 20 L2% (w ^ agar0 se gel. Cleaning up of the PCR products 

N-terrmnal sequencing of the 28 Kd fragment yields a included a phenol-chloroform extractio followed by further 

sequence homologous to the N-terminal part of 7 C H 2 vwma&(XL by HPLC (OEN-PAC FAX column. Waters) and 

domain and hence to the N-terminal end of the Fc fragment. q - fc usin ^ MERMAID or GENECLEAN R kit. 

The N-terminal sequence of the 14.4 Kd fragment corre- MQ mhi c)K appropriate. After these purification steps, 

sponds to the last lysine of a 7 C„2 and the N-tenrunal end 25 ^ ^ cDNA wfls kD # ted wi(h EcoMandRpnl 

of a y C«3 domain (Table 1). The molecular weight MW) f ^ ^ ^ ^ ^ ^ ^ ^ ^ #?2 

of the papain fragments and the identification of their clones A ^ phenol . chloroform extraction preceded the 

N-terminal sequences led us to conclude that the C„2 and u tion intQ uc 18(series=56 clones) OT in to pBluescript* 

C«3 domains of the y 3 heavy chains are normal in size and ( ser ies=72 clones) 

that the deletion must occur either in the C„l or in the \ HH 30 , , ' . . „ <. L . 

doniaintogeneratetheshcrtedy3chain.lliefractionswhich AU the clones obtained were smaller that the 860 base 

do not bind to Protein A Sepharose contain two bands of 34 ?™ s lo be * *** P os f f ed a com P^ X V »» «? d 

and 17 Kd which are more diffuse is SDS PAGE indicating C„l region Partial sequence data corresponding to the 

that they originate from the variable N-terminal part of the N-terminal of the V regxon reveals hat out of 20 clones, 

molecule (FIG 4) 35 3 were identical and possibly not independent. The 

\ . . , . . ttnvA . t , sequences obtained ressemble the human subgroup m and 

Lpon reduction, a single diffuse band of 17 Kd is found ^ murine su ma ^ * 

indicating that the 34 Kd is a disulfide bonded dimer of the & v ' 

17 Kd coniponent. The 34 Kd fragment apparently contains Clones corresponding to two different sets of C„2 protein 

the hinge and the N-terminal doinain V„„. * f en « s were obtained A first set of sequences (=72/41) 

^, e . , . . . _ _ 40 had a N-terminal C„2 region identical to the one obtained by 

^ss 8 ssl^ iw^^aSS r d ^ n ^ fthc 28 Kdpap r ^rr "* 1 

Jon of cDNA or genomicW ^ Y^^S^TSESS^ 

It has been shown that the cells from camel spleen imprint residucs encoded by (hc j Agones adjoining the hinge, 

cells reacted with rabbit and anti camel immunoglobulin 45 xh e q \ domain is entirely lacking. This cDNA corresponds 

sera and that the spleen was hence a site of synthesis of at tQ me ^ (Table 4) 

least one immunoglobulin class. cDNA was therefore syn- , ... /u~r*>n\ .u 1: 

thetised from camel spleen mRNA. The conditions for die plated sequence (#72/1) the proline in 

isolation of RNA were the following: total RNAwas isolated P° sitwn 259 15 re P laced * threonme - 

from the dromedary spleen by the guanidium isothiocyanate „ The sequence corresponding to the C„3 and the remaining 

method (15). mRNA was purified with oligo T-paramagnetic part of the C„2 was obtained by PCR of the CDNA using as 

beads Kpnl primer a poly T in which a Kpnl restriction site had 

cDNA synthesis is obtained using 1 ug mRNA template. «*« ****** at * c 5 ' end " ^ to ^ ^ Ce h ° f ' he 1? chai ^ 

an oligodT primer and reverse transcriptase corresponds to .a molecular weighty (MW) wfech is in good 

(BOERfflNGER MAN). Second strand CDNA is obtained 55 jeement wxth the data obtamed from SDS PAGE electro- 



using RNAse H and E. coli DNA polymerase I according tc 



the condition given by the supplier. The sequence of this y3 chain presents similarities with 

Relevant sequences were amplified by PCR: 5 ng of °* er J e * ce P l mat ***** me C » ! domain ' * e 

cDNA was amplified by PCR in a 100 pi reaction mixture domaul ^ ad J accnt to me m,l 8 e - 

(10 mM Tris-HCl P H 83. 50 mM KC1. 15 mM MgCl 2 , 60 One or aU three of the cysteines could be probably 

0.01 % (w/v) gelatine. 200 uM of each dNTP and 25 pmoles responsible for holding the two y3 chains together, 

of each primer) overlaid with mineral oil (Sigma). These results have allowed us to define a model for the 

Degenerate primers containing EcoRI and Kpnl sites and IgG 3 molecule based on sequence and papain cleavage (FIG. 

further cloned into pUC 18. After a round of denaturing and 5). 

annealing (94° C. for 5 min and 54° C. for 5 min), 2 units 65 Papain can cleave the molecule on each side of the hinge 

of Taq DNA polymerase were added to the reaction mixture disulfides and also between C„2 and C H 3. Under non 

before subjecting it to 35 cycles of amplification: 1 min at reducing conditions the \ HH domains of IgG 3 can be iso- 
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lated as disulfide linked dimer or as monomer depending on of the order of 80 A (2V HH ). This could be a severe 

the site of papain cleavage. limitation for agglutinating or cross linking. In the IgG 2 this 

A second set of clones #72/29 had a slightly different is compensated by the extremely long stretch of hinge, 

sequence for the C H 2 and was characterized by a very long composed of a 12-fold repeat of the sequence Pro-X (where 

hinge immediately preceded by the variable domain. This 5 X is Gin. Lys or Glu) and located N-terminal to the hinge 

hinge region has 3 cysteines at its C-terminal end in a Sulfide bridges. In contrast, in the human IgG3. the very 

sequence homologeous to the y3 hinge. Such second set of loa 8 w , hich . apparently arose as the result of 

clones could represent the IgG 2 subclass. For the constant * e f ence ^P^™ d ° es not contribute to increase the 

part of the ,3 and also for the .putative y2. most Cones are %££^^£&* "** 35 *** » 

identical showing the 72 or y3 specific sequences. A few to „ r . , . * , , 

clones such as #72/1 however show minor differences. For ^^l^t^^t^^^i^^' 

instance in the case of clones #72/1 two nucleotide differ- ^™J y fOtatl0nal freed ° m ° f the bindlng Slte VerSUS 0,6 Fc 

ences are detected. ' . . . . .. . , ... ^ 

, , Unhke myeloma heavy chains which result probably from 

Several regions cDNA s have now been totally or Cwl deletion in a single antibody producing cell, or heavy 

partially sequenced with the exception of a short stretch at 15 cha in antibodies produced by expression cloning (15); the 

the N-terminal end which is primer derived. camelid heavy chain antibodies (devoid of light chains) have 

Upon translation the majority shows the characteristic emerged in a normal immunological environment and it is 

heavy chain Ser 21 Cys 22 and Tyr^ Tyr 91 CySg 2 sequences, expected that they will have undergone the selective refine- 

of the intra V HH region disulfide bridge linking residues 22 ment in specificity and affinity accompanying B cell matu- 

and 92. All these clones have a sequence corresponding to 20 ration. 

the framework 4 (FR4) residues of the variable region Expression and purification of the camel V ww 21 (DR21 on 

immediately preceding the postulated hinge sequence (Table F* 0 - 7 > P rotein fr° m E - coli 

3). This sequence is generated by the J minigenes and is in ^ cIones 0311 te expressed in several types of expres- 

the majority of cases similar to the sequence encoded by the slon vectors - As an example using a commercially available 

human and murine J minigenes. The sequence length 23 vertor Immuno PBS (Huse etal: Science (1989) 246. 1275). 

between region Cys 92 and the C-terminal end of the V HH clones P roduced in Bluescript® according to the above 

regions is variable and, in the sequences determined, range described procedure, have been recovered by PCR using the 

from 25 to 37 amino-acids as one might expect from the sarae Xho1 containin § s Primer and a new 3' primer, 

rearrangements of J and D minigenes varying in length. corresponding to residues 113-103 in the framework of the 

Several important questions are raised by the sole exist- 30 ~°^?5a a^a^ rJ^r^rr rTr Irr 

ence of these heavy chain immunoglobulins in a non patho- £^ ™ ™ ? ^ A ° T a u ?2 ^ ACC f 

logical situation. First of all. are dfey bonafide antibodies? ™ J ^T±^unomsTJto 

^SS^^^^J^ Ho^ «he y end of the gene was not in phase with the 

antigens as shown in part I of these^xamples. This implies 35 fl^h ih § "S P t ^ Y^TJ 0 

that the camelid immune system generates an extensive adu ? Ve ^ C ™T WaS W '* Spe l" d &e 4 base 

number of binding sites composed of single V„„ dornTs °™*T T r Tf ^ 

This is confirmed by the diversity of the V„„ regions of the wtud. the vector was religated. 

heavy chain immunogobulins obtained by KR ™ e "P* 8 " 0 ? vector plasmid ipBS (immunopBS) 

The second question is "how are they Secreted?". The *° V"? ^ M » USed 



.. . . , . .. . .. - . , for immunoglobulin chain expression in E. coli under the 

secxeuon of immunoglobulin heavy chains composing four- promo{Qr LAC con(Iol a ribosome bindin ^ and 

chain model ^immunoglobulins does not occur under normal codolls . ^ addition ft conlains , s £ for fl c . tennini £ 

conditions. A chaperoning protem. the heavy chain binding decapeptide tag 

protein or B1P protein, prevents heavy chains from being 45 E coli Mm harborill g ^ ipB S-V„„21 plasmid was 
secreted. It is only when the fight chain displaces the BIP ^ , , 0 f TB medium with 100 ug/ml ampicillin and 

protein in the endoplasmic reticulum that secretion can QA% glucose at 32 o c Zs indeed by the 

0C ™ r V addition of 1 mM IPTG (final concentration) at an 0D S5O of 

The heavy chain dimer found in the serum of human or i, 0 . After overnight induction at 28° C the cells were 

mice with the so-called "heavy chain disease" lack the C„ 1 50 harvested by centrifugation at 4.000 g for 10 min (4° C.) and 

domains thought to harbour the BIP site (14). In the absence resuspended in 10 ml TES buffer (0.2M Tris-HCL pH 8.0, 

of thi domain the BIPprotein can no longer bind and prevent 0 .5 mM EDTA. 0.5M sucrose). The suspension was kept on 

the transport of the heavy chains. ice for 2 hours . Periplasmic proteins were removed by 

The presence in camels of a IgGl class composed of osmotic shock by addition of 20 ml TES buffer diluted 1:4 

heavy and light chains making up between 25% and 50% of 55 v/v with water, kept on ice for one hour and subsequently 

the total IgG molecules also raises the problem as to how centrifugated at 12.000 g for 30 min. at 4° C. The superna- 

maturation and class switching occurs and what the role of tant periplasmic fraction was dialysed against Tris-HCl pH 

the light chain is. The camelid light chain appears unusually 8.8, NaCl 50 mM. applied on a fast Q Sepharose flow 

large and heterogeneous when examined in SDS PAGE. (Pharmacia) column, washed with the above buffer prior and 

The largest dimension of an isolated domain is 40 A and 60 eluted with a linear gradient of 50 mM to 1M NaCl in buffer, 
the maximum attainable span between binding sites of a Fractions containing the V HH protein were further purified 

conventional IgG with C H \ and V HH will be of the order of on a Superdex 75 column (Pharmacia) equilibrated with 

160 A (2V WW +2C H 1) (19). The deletion of C„l domain in PBS buffer (0.01M phosphate pH 7.2. 0.15M NaCl). The 

the two types of heavy chain antibodies devoid of light yield of purified VHI protein varies from 2 to 5 mg/1 cell 

chains, already sequenced has, as a result, a modification of 65 culture. 

this maximum span (FIG. 6). In the IgG3 the extreme Fractions were analyzed by SDS-PAGE(I). Positive iden- 

distance between the extremities of the \ HH regions will be tification of the camel \ HH antibody fragment was done by 
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Western Blot analysis using antibody raised in rabbits Sepharose column chromatography (CEluted at 650 mM 

against purified camel IgGH 3 and an anti-rabbit IgG-alkaline NaCl Dduted at 700 mM NaCl) Effractions from Super- 

phosphatase conjugate (II). dex 75 column chromatography. 

As protein standards (Pharmacia) periplasmic proteins As can be seen, the major impurity is eliminated by 

prepared from 1 ml of HTG-induced JMlOl/ipBS V w „21 5 ionexchange chromatography and the bulk of the remaining 

were used. FIG. 8 shows: C.D:fractions from fast S impurities are eliminated by gel filtration. 



Comparison of the N terminal Camel C H 2 and C H 3 sequences with the translated 
cDNA sequences of Camel immunoglobulins and with the corresponding human y sequer 
(Numbering according to Kabat et al (1987X7). 



Clone 
Clone 



250 



260 



J F P f 



' K D V L S 



P T K P K D 
-LPGGPSVFVFPPKPKDVLS 
-LLGGPSVFIFPPKPKDVLS 
-LLGGPS VFLF PPKPKDTLM 
-VA-GPS VFLF PPKPKDTLM 
-FLGGPS VFLF PPKPKDTLM 
C«2 | C„3 



S G R P SEQIDNO:56 - 

S G R P SEQIDNO:57 - 

S R T P SEQIDNO:112 - 

S R T P SEQIDNO:113 - 

S R T P SEQIDNO:114 - 



360 



370 



-KlGQTREPQVYTLAPXRLEL SEQIDNO:54 - 
- KOQP REPQVYTLPP S RDBL SEQIDNO:115 - 
-KGQPREPQVYTLPPSREEM SEQE>NO:116 - 
-K|GQPREPQVYTLPPSQEEM SEQIDNO:117 - 



30 



in V H m subgroup protein and a mouse V H DIA subgroup protein. 
The invariable subgroup specific residues are grayed. 

10 20 30 

Primer Derived GGSVQTGGSLRLSCEI SGLTFD #72/4 SEQIDNO:l 

GGSVQTGGSLRLSCAVSGFSFS #72/3 SEQ ID NO:2 

GGSEQGGGSLRLSCAI SGYTYG #72/7 SEQ ID NO:3 

GGSVQPGGSLTLSCTVS GATYS #72/17 SEQ ID NO:4 

GGSVQAGGSLRLSCTGSGFPYS #72718 SEQ ID NO:5 

DVQLVAS GGGS VGAGGS L R L S CTAS GDS F S #72/2 SEQ ID NO:58 

EVKLVES GGGLVEPGGS LRLS CATS GF TF S Mouse V H nU SEQIDNO:118 

EVQLLSGGGLVQPGGSLRLSCAASGFTFS Human V»m SEQIDNO:119 



TABLE 3 

Comparison of some Framework 4 residues found in the Camel region with the Framework 4 
residues corresponding to the consensus region of the Human and Mouse J minigenes. 

FrameWork 4 I Genes 

Human WGQGTLVTVS S SEQIDNO:9 J1,J4,J5 

WGRGTLVTVS S SEQIDNO:130 J2 

WGQGTTVTVS S SEQIDNO:120 16 

WGQGTMVTVS S SEQIDNO:121 33 
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TABLE 3-continued 



WGQGTTL TVS S SEQIDNO:122 

WGQGTLVTVS S SEQIDNO:9 

WGQGTSVTVSS SEQIDNO:123 

WGAGTTVTVS S SEQIDNO:124 



cDNA Clones 



WGQGTQVTVSS SEQIDNO:8 Clones 

WGQGTQVTVSS SEQ ID NO:8 #72/19 = #?2/3 

WGQGTLVTVS S SEQ ID NO:9 1 Clone 

WGRGTQVTVS S SEQ ID NO:59 #72/24 

WGQGTHVTVS S SEQIDNO:60 #72/21 

WGQGI QVTASS SEQIDNO:61 #72/16 



25 



24 r 1 t 

25 d g * 



i W G A ' 
105 



'TVS S SEQ1DN0:12; Mouse V» ;/J i 



cl WGQGTQVTVSS SEQE>NO:13 

gc RGQGTQVTVSL SEQH>NO:14 

ky WGQGTQVTVSS SEQIDNO:15 

my WGQGTQVTVSS SEQ ID NO:16 

<jy WGQGTQVTVSS SEQ ID N0:17 

g q WGQGAQVTVS S SEQIDNO:18 

t r t f a y n y WGQGTQVTVS S SEQIDNO:19 : 

n n WGQGTQVTAS S SEQ ID NO:20 

-sdy n y WGQGI QVTASS SEQ ID NO:21 

ky WGQGTQVTVS S SEQ ID NO:22 18differentca 

i fgw--ddFGQGTOVTVSS SEQ ID NO:23 VHfliegion 

n v WGQGTQVTVS S SEQ ID NO:24 

t r WGQGTQVTVSS SEQ ID NO:25 

^gay ai WGQGTHVTVS S SEQ ID NO:26 

I t f a yny WGRGTQVTVS S SEQ ID NO:27 

IcedgynyWGQGTQVTVSS SEQ ID NO:28 

dv WGQGTQVTVS S SEQ ID NO:29 

r y GDPGTQVTVS S SEQE>NO:30 



Human* mouse -size range 0-19 ea over 600 entries 
Camel 8-24 ea 18 entries 
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10 20 
EVQLVESGGG LVQPGQSLRL SCAASG SEQIDNO:80 

GG SVQGGGSLRL SCAISG SEQIDNO:84 

GG SVQAGGSLRL SCASSS SEQIDNO:88 

70 80 90 



RFTIS RDNSKNTLYL QMNSLRAEDTAVY 
RFIIS QDSTLKTMYL LMNNLKFEDTGTY 
RFI1S QDSAKNTVYL QMNSLKPEDTAMY 
camel Vhh 
WGQGTQVT VSS-- 
WGQGTQVT VSS - 



YCAR SEQIDNO:82 
YCAA SEQ ID NO:86 
YCKI SEQIDNO:90 

GTNEVCKCPKCP 



KPEPECTCPKCP 



WVRQA PGKGLEWVS SEQ ID NO:81 
WE REG POKE REG I A SEQIDNO:85 
WYRQA PGKEREFVS SEQIDNO:89 
110 



CDR2 i 
CDR2 I 
_<;UR2j 



WGQGTLVT VSS SEQff>NO:83 
WGQGTQVT VSS SEQIDN0:91 
WGQGTQVT VSS SEQH)NO:87 
C fl 2 

APELPGG PSVFVFP SEQIDNO:91 



APELLGG PSVFIFP SEQIDNO:87 



ELKTPLGDTTHTCPRCP 
EPKCSDTPPPCPRCP 
EPKSCDTPPPCPRCP 



KVKVTV •■ 
KVDKW 



APELLGG PSVFLFP SEQ ID NO: 126 

APELLGG PSVFLFP SEQ ID NO: 127 

APPWJ- PSVFLFP SEQ ID NO: 128 

APEFLGG PSVFLFP SEQ ID NO. 129 
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( 1 ) GENERAL INFORMATION: 



5,759.808 




i ) LOCATION: 1..22 



) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



) INFORMATION FOR SEQ ID NO:2: 



( D ) TOPOLOGY: lii 
) MOLECULE TYPE: peptide 
) FEATURE: 



) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 22 
( B )TYPE:m 




2 ) INFORMATION FOR SEQ ID NO:4: 

( i ) SEQUENCE CHARACTERISTICS: 

( B ) TYPE: amino acid 

( C ) STRANDEDNESS: single 
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( D ) OTHER INFORMATION: /label=FRAMEWORK 1 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
Gly Gly Ser Val Gin Pro Gly Gly Sm Leo Thr Leu Ser Cys Thr Val 



) INFORMATION FOR SEQ ID NO:5: 

( i ) SEQUENCE CHARACTERISTICS: 
( A > LENGTH: 22 amino acids 
( B ) TYPE: amino acid 



( i x ) FEATURE: 

( A ) NAME/KEY: Domain 
( B ) LOCATION: 1..22 

( D ) OTHER INFORMATION: /label=FRAMEWORK 1 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NOJ: 
Gly Gly Ser Val Gin Ala Gly Gly Scr Leu Aig Leu Ser Cys Thr Oly 



►RMATTON FOR SEQ ID NO*: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21 amino acids 
( B ) TYPE: amino acid 



( D )TOPOLOGY:li 
) MOLECULE TYPE: pepcidi 



. ) INFORMATION FOR SEQ ID NO:7: 



( B ) TYPE: amino acid 
( C ) STRANDEDNESS: ai 
( D ) TOPOLOGY: linear 



) MOLECULE T1 
) FEATURE: 



5,759,808 



) INFORMATION FOR SEQ ID NO:8: 



) SEQUENCE CI 

( A ) LENGTH: 11 amino i 

( C ) STRANDEDNESS: si 
( D ) TOPOLOGY: linear 



) INFORMATION FOR SEQ ID NO* 

:E CHARACTERISTICS: 
( B ) TYPE: amino acid 



DESCRIPTION : SEQ ID 



) INFORMATION FOR SEQ ID NO:10: 



( D ) OTHER INFORMATION: b 
) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
p G 1 y Gin Gl> Ala Gin Vil Thi 



( 2 ) INFORMATION FOR SEQ ID NO: 1 1 : 

( i ) SEQUENCE CHARACTERS 
( A ) LENGTH: 11 am 



5,759,808 



SQIDNO:ll: 

T r p Gly Gin Gly Hi Gin Val 



) INFORMATION FOR SEQ ID NO:12: 

( i ) SEQUENCE CHARACTERISTICS: 



( D ) OTHER INFORMATION: /label=FRAMEWORK 4 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:t2: 
An Gly Oln Oly Thr Gin Vol Thr Val Set Le 

) INFORMATION FOR SEQ ID NO 13: 



) FEATURE: 

( B ) LOCATION: 1..14 



) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

a Leu Oln Pro Gly Oly Tyr Cy. Oly 



( D ) TOPOLOGY: linear 
) MOLECULE TYPE: peptide 



I SEQUENCE DESCRIPTION: SEQ ID NO:14: 
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( i ) SEQUENCE CHARACTERISTICS: 



5,759,808 

-continued 



( D ) OTHER INFORMATION: 
) SEQUENCE DESCRIPTION: SEQ ID 



) INFORMATION FOR SEQ IE 



) SEQUENCE DESCRIPTION: SEQ ID N< 



( 2 ) INFORMATION FOR SEQ ID NOJO: 

( i ) SEQUENCE CHARACTERISTICS: 



) MOLECULE TYPE: pet 



( A ) NAME/KEY: Domain 

( B ) LOCATION: 1.16 

( D ) OTHER INFORMATION: /kbd=CDI 

) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



( 2 ) INFORMATION FOR SEQ ID NO:2I: 

( i ) SEQUENCE CHARACTERISTICS: 



( i I ) FEATURE: 
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) INFORMATION FOR SEQ ID NO-.25: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 27 amino acids 
( B ) TYPE: amino acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( i x > FEATURE: 



) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



T r p Gl y G 1 
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( i i ) MOLECULE TYPE: peptide 



( i I ) FEATURE: 

( A ) NAME/KEY: Domain 

( B ) LOCATION: 1..12 

( D ) OTHER INFORMATION: flabd=CH3 

( I i ) SEQUENCE DESCRIPTION: SEQ ID N035: 



( 2 ) INFORMATION FOR SF.Q [D NO:36: 



( A ) NAME/KEY: Domain 

( B ) LOCATION: 1..18 

{ D ) OTHER INFORMATION: /Iabel=CH3 

( I i ) SEQUENCE DESCRIPTION: SEQ ID SO-36: 

Oly 01. Thr Arg Glu Pro O 1 n Val Tyr Ilr Leu All Pro Xaa Arg Leu 




( A ) NAME/KEY: Region 

( B ) LOCATION: 1..12 

( D ) OTHER INFORMATION: 



( i i ) SEQUENCE DESCRIPTION: SEQ ID N037: 



( i ) SEQUENCE CHARACTERISTICS: 



( D ) TOPOLOGY: linear 

) MOLECULE TYPE: peptide 

) FEATURE: 

( A ) NAME/KEY: Region 

( B ) LOCATION: 1.35 

( D ) OTHER INFORMATION: 

) SEQUENCE DESCRIPTION: SEQ ID N< 



. ) INFORMATION FOR SEQ ID NO:39: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 28 amino acids 
( B ) TYPE: amino acid 



( D ) TOPOLOGY: linear 
( i i ) MOLECULE TYPE: peptide 



) SEQUENCE DESCRIPTION: SEQ ID NO'39: 



) INFORMATION FOR SEQ ID NO^O: 

( i ) SEQUENCE CHARACTERISTICS: 



X a a Pro Ly 



( B ) LOCATION: 1. 
( D ) OTHER INFOR 

) SEQUENCE DESCRIPTION: SEQ ID NCC40: 

a Pro Olu Leu Pro G 1 y 



) INFORMATION FOR SEQ ID NO:41 : 

( i ) SEQUENCE CHARACTERISTICS: 



57 



5,759.808 

-continued 



58 




( x ■ ) SEQUENCE DESCRIPTION: SEQ ID NO:44: 



59 



a 1 Ph e lie fbt 



) INFORMATION FOR SEQ ID NO:«: 



( B ) TYPE: amino acid 

( C ) STRANDEDNBSS: single 

( D ) TOPOLOGY: linear 

) MOLECULE TYPE: peptide 

) FEATURE: 



) FEATURE: 

( A ) NAME/KEY: Domain 

( B ) LOCATION: 1.14 

( D ) OTHER INFORMATION: /label=CH2 

) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

. Pro G 1 u Leu Pro Oly Olj Pr 



( D ) TOPOLOGY: linear 
) MOLECULE TYPE: peptide 
) FEATURE: 



) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

a Pro Oli Leu Leu Oly Ol 



) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 21Wpair> 
( B ) TYPE: nude* acid 
( C 1 STRANDEDNBSS: single 
( D ) TOPOLOGY: linear 



) MOLECULE TYPE: Ol 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
CGCCATCAAG GTAACAGTTG A 



( 2 ) INFORMATION FOR SEQ ID NO:48: 

{ i ) SEQUENCE CHARACTERISTICS: 
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, ) DESCRIPTION: DNA < s) 



( B ) LOCATION: 12..17 
( D ) OTHER INFORMATION: /Iabel=XhoI rii 
/ note= "RESTRICTION SITE" 



< i i ) SEQUENCE DESCRIPTION: SEQ ID 
GGTCCAOCT OCTCGAGTCT GO 

2 INFORMATION FOR SEQ ID NO:49: 



( A ) LENGTH: 22 





( A ) NAME/KEY: mi 
( B ) LOCATION: 12..17 
( D ) OTHER INFORMATION: /li 



) SEQUENCE CHARACTERIST 



) MOLECULE TYPE: Other 



( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:51 : 
TTAACTAG TGAGOAGACG GTGACCTO 

) INFORMATION FOR SEQ ID NO:52: 

( i ) SEQUENCE CHARACTERISTICS: 
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( * i ) SEQUENCE DESCRIPTION: SEQ ID NOSJ: 



( 2 ) INFORMATION FOR SEQ ID NO J4: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 43 amino acids 
( B ) TYPE: amino acid 



( D ) TOPOLOGY: linear 
) MOLECULE TYPE: peptide 




(Hi) SEQUENCE DESCRIPTION: SEQ ID N054: 
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(in) FEATURE: 

( A ) NAME/KEY: Domain 
( B ) LOCATION: 1.11 
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< i i ) SEQUENCE 



) INFORMATION FOR SEQ ID NOrfl: 

( i ) SEQUENCE CHARACrERISTlCS: 



( A ) NAME/KEY: D 



) INFORMATION FOR SEQ ID NO*2: 

( i ) SEQUENCE CHARACTERISTICS: 

( B ) TYPE: amino acid 

( C ) STRANDEDNESS: single 



) MOLECULE TYPE: j*< 



) NAME/KEY: Region 
) LOCATION: 1..14 
. ) OTHER INFORMATION: /label=VH 



) FEATURE: 

( D ) OTHER INFORMATION: /labd=CDR3 
) SEQUENCE DESCRIPTION: SEQ ID NO:62; 
a Leu Oln Pro Gly Gly Ty 



( 2 ) INFORMATION FOR SEQ ID NO«: 

( i ) SEQUENCE CHARACTERISTICS: 




( A ) NAME/KEY: Region 

( B ) LOCATION: 1.12 

( D ) OTHER INFORMATION: a 



( A ) NAME/KEY: Domain 

( B ) LOCATION: 1..12 

( D ) OTHER INFORMATION: /Iabel=CDR3 



( I i ) SEQUENCE DESCRIPTION: SEQ ID NO:63: 
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( v k ) ORIGINAL SOURCE: 



5,759,808 
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( D ) OTHER INFORMATION: /1abcl=CDR3 
( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:fjCj: 



( 2 ) INFORMATION FOR SEQ ID NO:67: 

( i ) SEQUENCE CHARACTERISTICS: 




( D ) OTHER INFORMATION: /label=VH 

) FEATURE: 

( A ) NAME/KEY: Domain 

( B ) LOCATION: 1.17 

( D ) OTHER INFORMATION: /label=CDR3 

) SEQUENCE DESCRIPTION: SEQ ID NO*7: 



) INFORMATION FOR SEQ ID NO.-68: 



) MOLECULE TYPE: protein 

) FEATURE: 

( A ) NAME/KEY: Region 

( B ) LOCATION: IM 

( D ) OTHER INFORMATION: Ik 



) INFORMATION FOR SEQ ID NO:«9: 

( i ) SEQUENCE CHARACTERISTICS: 
(A ) LENGTH: 16 amino acids 
( B ) TYPE: amino acid 
( C ) STRANDEDNESS : single 



< i i ) MOLECULE TYPE: protein 



5,759,808 
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( A | NAME/KEY: Domain 

( B ) LOCATION: 1..16 

( D > OTHER INFORMATION: /label=CDR3 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:®: 
Ola Lys Lys Asp Arg T h r Arg Trp Ala Olu Pro Aig Oil Trp A s n Asn 



( D ) TOPOLOGY: lin 
) MOLECULE TYPE: protein 



( A ) NAME/KEY: Region 

( B ) LOCATION: l_2] 

( D ) OTHER INFORMATION: /la 

) FEATURE: 

( A > NAME/KEY: Domain 

( B ) LOCATION: 1..21 

( D ) OTHER INFORMATION: /la 



) INFORMATION FOR SEQ ID NO:71 : 

( i ) SEQUENCE CHARACTERISTICS: 

< A ) LENGTH: 16 amino acids 

< B ) TYPE: amino acid 




( 2 ) INFORMATION FOR SEQ ID 
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( B ) LOCATION: 1. 



) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

p Ser Pro Cys Tyr Met Pio Thi 



) INFORMATION FOR SEQ ID 



) MOLECULE TYPE: proton 

) FEATURE: 

( A ) NAME/KEY: Region 

( B ) LOCATION: 1.15 

( D ) OTHER INFORMATION: A 



( D ) OTHER INFORMATION: 



( 2 ) INFORMATION FOR SEQ ID NO:74: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 16 amino acids 



( D ) TOPOLOGY: linear 

) MOLECULE TYPE: protein 

) FEATURE: 

( A ) NAME/KEY: Region 



) SEQUENCE DESCRIPTION: SEQ ID NO:74: 

i Glu lie Gin Trp Tyi Oly Cys Asn Leu Arg Thi Thr Pee T b 



( 2 ) INFORMATION FOR SEQ ID NO:75: 

( i ) SEQUENCE CHARACTERISTICS: 



5,759.808 
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) FEATURE: 

< A ) NAME/KEY: Domain 

< B ) LOCATION: 1..22 

( D ) OTHER INFORMATION: /lab«l=CDR3 

) SEQUENCE DESCRIPTION: SEQ ID NO:75: 



) INFORMATION FOR SEQ ID NO:7«: 

( i ) SEQUENCE CHARACTERIS 



) FEATURE: 

( A ) NAME/KEY: Region 

< B ) LOCATION: 1.24 

( D ) OTHER INFORMATION: /U 



Olu Me I Oly Ala Cys 



( 2 ) INFORMATION FOR SEQ ID NO:T7: 



( B ) LOCATION: 1..24 



( D ) OTHER INFORMATION: /labcl=CDR3 



( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:77: 
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( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:89: 
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( i i ) MOLECULE TYPE: cDNA 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

TCTTCTAAAT ATATGCCTTG CACCTACGAC ATGACCTGGT ACCGCCAOGC TCCAGOCAAG 

GAGCGCGAAT TTGTC TCAAG TATAAATATT OATGGTAAGA CAACATACGC AGACTCCGTG 

AAOOGCCGAT TCACCATCTC CCAAGACAOC OCCAAGAACA CGGTGTATCT GCAGATGAAC 

AGCCTGAAAC CTGAGGACAC GGCOATGTAT TACTGTAAAA TAOATTCGTA CCCGTOCCAT 

CTCCTTGATG TCTGGGGCCA GGOGACCCAO GTCACCGTCT CCTCACTAGT TACCCOTACG 



AGCTTCCGGA CTACGGTTC 
( 2 ) INFORMATION FOR SEQ ID NO:94: 



( i i ) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 

CAGOTGAAAC TGCTCOAGTC TOGAGOAGGC TCOGTGCAGA CTGGAGGATC TCTGAGACTC 

ICCIOTOCAO TCTCTGOATT CTCCTTTAGT ACCAGTTGTA TGGCCTOOTT CCGCCAGGCT 

TCAGOAAAGC AOCOTOAGOO GGTCGCAOCC ATTAATAOTO OCOOTGOTAG OACATACTAC 

AACACATATG TCGCCGAGTC CGIGAAGOOC COAITCOCCA TCTCCCAAGA CAACGCCAAG 

ACCACOOIAT ATCTTGATAT OAACAACCTA ACCCCIOAAO ACACOOCTAC GTATTACTOT 

GCOOCGOICC CAOCCCACTT GOGACCTGOC GCCATTCTTG ATTTGAAAAA GTATAAOTAC 

TGGGOCCAOO OGACCCAGGT CACCOTCTCC TCACTAGCTA OTTACCCGTA COACGTTCCG 

OACTACOOTT CTTAATAGAA TIC 

( 2 ) INFORMATION FOR SEQ ID NCWB: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 433 base pairs 
( B ) TYPE: nucleic acid 
( C ) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 



GTGAGOGGG TCACAGCOAT TAACACTOAT OOCAGTATCA TATACGCAGC CGACTCCGTO 

AGGGCCOAT TCACCATCTC CCAAGACACC GCCAAOGAAA CGGTACATCT CCAGATGAAC 

ACCTGCAAC CTGAGGATAC GGCCACCTAT TACTGCGCGG CAAGACTGAC GGAOATGOGG 

CTTGTGATG CGAGATGGOC GACCTTAGCG ACAAOGACGT TTGCOTATAA CTACTGGGGC 

GGOOGACCC AGGTCACCGT CTCCTCACTA OTTACCCGTA COACGTTCCG OACTACGGTT 



( 2 ) INFORMATION FOR SEQ ID NO:96: 



5.759.8i 



) SEQUENCE CHARACTERISTICS: 



) MOLECULE TYPE: cDNA 

) SEQUENCE DESCRIPTION: SEQ ID NO:95: 

AAC TOCTCOAOTC TGGGGGAGGG TCOGTOCAOG CTOOAOOOTC TCTGAGACTC 

ATG TCTCTGGCTC TCCCAGTAGT ACTTATTOCC TGGOCTGGTT CCOCCAGGCT 

AGO AOCGTGAGGO OGTCACAGCG ATTAACACTG ATOGCAOTGT CATATACGCA 

CCO TGAAGGGCCG ATTCACCATC TCCC AAGACA CCGCCAAGAA AACGGTATAT 

, T G A ACAACCTOCA ACCTGAGGAT ACGOCCACCT ATTACTGCGC GGCAAGACTG 

.TOO OOOCTTOTOA TGCGAGATOG GCGACCTTAO COACAAGGAC OTTTGCOTAT 

GOG GCCGGGOGAC CCAGGTCACC GTCTCCTCAC TAGCTAOTTA CCCGTACGAC 

tACT ACGGTTCTTA ATAGAATTC 



) INFORMATION FOR SEQ ID NG97: 

( i ) SEQUENCE CHARACTERISTICS: 



( D ) TOPOLOGY: linear 
i i ) MOLECULE TYPE: cDNA 
x i ) SEQUENCE DESCRIPTION: SEQ ID NOS7: 

OTCTO OAGGAGGCTC GGCOCAOGCT OGAOOATCTC TOAOACTCTC 



AGCCTGAAAC CTOAGOACAC GGCCATCTAC TTCTOTGCAO CAOOCTCOCO TTTTTCTAGT 
CCTOTTGOGA GCACTTCTAO ACTCGAAAGT AOCGACTATA ACTATTGOGG CCAGGGGATC 
CAGOTCACCG TCACCTCACT AOTTACCCGT ACGACGTTCC GOACTACOGT TCTTAATAOA 



) SEQUENCE CHARACTERISTICS: 



( i i ) MOLECULE TYPE: cDNA 



CTCGAGTCTG OAGGAGGCTC GGTTCAGGCT OGAGGGTCCC TTAGACTCTC CTGTOCAGCC 

TCTGACTACA CCATCACTGA TTATTGCATG OCCTOOTICC GCCAGOCTCC AOGGAAGGAG 

CGTGAATTGG TCGCAGCGAT TCAAGT TGTC CGTAGTGATA CTCOCCTCAC AGACTACGCC 

GACTCCGTGA AGGGACGATT CACCATCTCC CAAGGCAACA CCAAGAACAC AGTGAATCTG 

CAAATGAACA GCCTGACACC TGAGGACACG OCCATCTACA GTTGTOCOOC AACCAGTAGT 

TTTTACTGGT ACTGCACCAC GGCGCCTTAT AACGTCTGGO GTCAGGGGAC CCAGGTCACC 
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) INFORMATION FOR SBQ ID NO:102: 



( D ) TOPOLOGY: lines 
i i ) MOLECULE TYPE: cDNA 



CTCGAGTCTG GGOOAOGCTC OGTACAAACT GGAGGOTCTC TGAGACTCTC TTOCGAAATC 

TCTOGATTOA CTTTTGATGA TTCTGACGTG OOOTOOTACC GCCAOGCTCC AGGGGATGAG 

TGCAAATTOG TCTCAOGTAT TCTGAOTGAT GOTACTCCAT AT ACAAAGAG TOOAOACTAT 

GCTGAGTCTG TGAGGGGCCG OGTTACCATC TCCAGAGACA ACGCCAAOAA CATGATATAC 

CTTCAAATOA ACOACCTGAA ACCTGAOQAC ACGGCCATGT ATTACTGCGC GOTAGATGGT 

TGGACCCGGA AGGAAOGGGG AATCGGGTTA CCCTOOTCGG TCCAATGTGA AOATOOTTAT 

AACTATTGOG GCCAGOGGAC CCAGGTCACC GTCTCCTCAC TAOTTACCCG TACGACGTTC 

COOACTACOG TTCTTAATAG AATTC 

( 2 ) INFORMATION FOR SEQ ID NO:103: 



) STRANDEDNESS: single 



CTCGAGTCTG GAOGAGOCTC GGTGCAGGCT GGAGGGTCTC TGAGACTCTC CTGTOTAGCC 

TCTOGATTCA ATTTCGAAAC TTCTCGTATG OCGTGGTACC GCCAOACTCC AOGAAATGTG 

TOTOAOTTGG TCTCAAGTAT TTACAGTGAT OGCAAAACAT ACTACGTCOA CCGCATOAAG 

OOCCGATTCA CCATTTCTAG AOAGAATOCC AAOAATACAT TGTATCTACA ACTGAGCGOC 

CTCAAACCTG AGGACACGGC CATGTATTAC TGTOCGCCOG TTGAATATCC TATTOCAGAC 

ATGTOTTCGA OATACGGCGA CCCGOGGACC CAGGTCACCO TCTCCTCACT AOTTACCCOT 

ACOACGAACC GGACTACGGT TCTTAATAGA ATTC 

( 2 ) INFORMATION FOR SEQ ID NO:104: 



( D ) TOPOLOGY: lin 



( l i ) SEQUENCE DESCRIPTION: SEQ ID NO:104: 

CTCGAGTCTG GGGGAGGCTC GGTGCAGGTT GGAGGGTCTC TGAAACTCTC CTGTAAAATC 

TCTOGAGGTA CCCCAGATCG TGTTCCTAAA TCTTTGOCCT OOTTCCGCCA GOCTCCAGAG 

AAGGAGCGCG AGGGGATCGC AOTTCTTTCG ACTAAQOATG GTAAGACATT CTATGCCGAC 
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GCCTCGAG AATOGAACAA CTGGOGCCAG OGGACCCAGG TCACCGTCTC CTCA 
) INFORMATION FOR SEQ ID NO: 108: 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 381 base pairs 
( B )TYPE: nucleic acid 



( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: cDNA 

( X i ) SEQUENCE DESCRIPTION: SEQ ID NO:108: 

CTCGAGTCAO GTGTCCGGTC TOATGTGCAG CTGGTGOCGT CTOOOOOAOO CTCOOIOCAO 

OCTGGAGGCT CTCTOAGACT CTCCTGTACA OCCTCTOGAO ACAGTTTCAO TAGATTTGCC 

ATGTCTTGGT TCCGCCAGGC TCCAGOGAAG GAGTGCGAAT TOOTCTCAAG CATTCAAAGT 

AATOGAAOGA CAAC TGAGGC COATTCCGTG CAAGOCCGAT TCACCATCTC CCGAOACAAT 

TCCAGOAACA CAGTGTATCT GCAAATOAAC AGCCTOAAAC CCGAGGACAC GOCCGTGTAT 

TACTGTGOGG CAGTCTCCCT AATGGACCOA ATTTCCCAAC ATGGGTGCCG GGGCCAOGGA 

ACCCAOGTCA CCGTCTCCTT A 

( 2 ) INFORMATION FOR SEQ ID NO:109: 

( i ) SEQUENCE CHARACTERISTICS: 



( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:109: 
Gly Ola Pro Arg OH Pro Gin Val Tyr T h r Leu Pro Pro Ser Arg Asp 



) SEQUENCE CHARACTERISTICS: 



( i i ) SEQUENCE DESCRIPTION: SEQ ID NO.I10: 
Gly Gin Pro Arg Oil Pro Gin Val Tyr Ibi Leu Pro Pro Ser Arg Glu 
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) SEQUENCE DESCRIPTION: SEQ ID NO:Ill: 



) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 24 amino acids 
( B ) TYPE: amino acid 
( D ) TOPOLOGY: linear 



) SEQUENCE DESCRIPTION: SEQ ID N( 



) INFORMATION FOR SEQ ID NO:113: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 23 amino acids 
( B ) TYPE: amino acid 



) SEQUENCE DESCRIPTION: SEQ ID NO:113: 



) SEQUENCE CHARACTERISTICS: 

( B j TYPE: amino acid 
( D ) TOPOLOGY: linear 

) MOLECULE TYPE: peptide 

) SEQUENCE DESCRIPTION: SEQ ID NO:114: 



) INFORMATION FOR SEQ ID NO:115: 

( i ) SEQUENCE CHARACTERISTICS: 



( I i ) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
L y s Gly Gin Pro Arg Glu 



( 2 )INFORMATIONFORSEQIDNO:116: 
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) SEQUENCE CHARACTERISTICS: 



i ) MOLECULE TYPE: peptide 

i ) SEQUENCE DESCRIPTION: SEQ ID NO:116: 

ys Gly Oil Pro Aig Glu Pro O 1 it Val Tyr Thr Leu Pro Pro Se 



( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 19 amino acids 

( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 

( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:117: 

Lys Gly Oil Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Gin 



) SEQUENCE CHARACTERISTICS: 



) MOLECULE TYPE: peptide 

) SEQUENCE DESCRIPTION: SEQ ID 



( 2 ) INFORMATION FOR SEQ ID NO:l 19: 

( i ) SEQUENCE CHARACTERISTICS: 



) MOLECULE TYPE: peporV 



) INFORMATION FOR SEQ ID NO:120: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 amino acids 
( B )TlT«: amino acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 



( i i ) SEQUENCE DESCRIPTION: SEQ ID NO:120: 
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( i i ) MOLECULE TYPE: peptide 
( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:121: 
Tip Gly Oil Gly Thr Met V a 1 Tht Val Set Se 

) INFORMATION FOR SEQ ID NO:122: 

( i ) SEQUENCE CHARACTERISTICS: 



) INFORMATION FOR SEQ ID NO:I21: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 amino acids 
( B ) TYPE: amino acid 



) SEQUENCE DESCRIPTION: SEQ ID NO:122: 

p Gly CI. Gly Thr Thi Le 



) INFORMATION FOR SEQ ID NK 



FOR SEQ ID NO: 124: 
SEQUENCE CHARACTERISTICS: 



) SEQUENCE DESCRIPTION: SEQ ID NO:124: 



i ) LENGTH: 21 an 
. ) TYPE: amino ac 
i ) TOPOLOGY: lift 



( I i ) SEQUENCE DESCRIPTION: SEQ ID NO:12S: 
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(li) SEQUENCE DESCRIPTION: SEQ ID NO:129: 
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( 2 ) INFORMATION FOR SEQ ID NO:130: 

( i ) SEQUENCE CHARACTERISTICS: 
( A ) LENGTH: 11 amino acids 
( B ) TYPE: annuo acid 
( D ) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: peptide 



We claim: 

1. Nucleotide sequence encoding an immunoglobulin or a 
fragment of an immunoglobulin selected from the group 
consisting of the V HH , hinge, FV H „h. F(V HH h) 2 . CDR, 
CDRj, CDR 3 . FW. C„2, and C„3, said immunoglobulin 
comprising two heavy polypeptide chains, each heavy chain 
consisting of a complete antigen binding site, said immu- 
noglobulin containing a variable (V„„) region and a con- 
stant region, said constant region being devoid of first 
constant domain C H l. wherein the immunoglobulin is 
devoid of polypeptide light chains, which immunoglobulin 
comprises a peptide sequence selected from the group 
consisting of: 

VTVSSGTNEVCKCPKCPAPELPGOPSWFWFP (SEQ ID NO:43), 
VTVSSEPKPQPQPKPQPQPQPQPKPQPQPEPE 

CTCPKCPAPELLGGPSWIFP (SEQ ID NO:44) 
GTNEVCKCPKCP (SEQ ED NO:37) 
APELPGGPSVFVFP (SEQ ID NO:45) 

EPK1PQPQPKPQPQPQPQPPKPQPKPEPEECTCPKCP (SEQ ID NO:38) 
APELLGGPSVFIFP (SEQ ID NO:46) 
APELLGGPTVFIFPPKPKDVLSrrLTP (SEQ ID NO:31) 
APEIJTOPSWVFPTKPKDVLSISGRP (SEQ ID NO:32) 
APELPGGPSVFVFPPKPKDVLSISGRP (SEQ ID NO:33) 
APFT I QGPS VTTFPPKPKDVLSISGRP (SEQ ID NO:34) 
GQTREPQVYTLA (SEQ ID NO:35) 
GQTREPQVYTLAPXRLEL (SEQ ID NO:36) 
GQPREPQVYTLPPSRDEL (SEQ ID NO: 109) 
GQPREPQVYTLPPSREEM (SEQ ED NO:110) 
GQPREPQVYTLPPSQEEM (SEQ ID NO:lll) 
GGSVQTGGSLRLSCEISGLTFD (SEQ ID NO:l) 
GGSVQTGGSLRLSCAVSGFSFS (SEQ ID NO:2) 
GGSEQGGGSLRLSCAISGYTYG (SEQ ID NO:3) 
GGSVQPGGSLTLSCTVSGATYS (SEQ ID NO:4) 
GGSVQAGGSLRLSCTGSGFPYS (SEQ ID NO:5) 
GGSVQAGGSLRLSCVAGFGTS (SEQ ID NO:6) 
GGSVQAGGSLRLSCVSFSPSS (SEQ ID NO:7) 
WGQGTQVTVSS (SEQ ID NO:8) 
WGQGTLVTVSS (SEQ ID NO:9) 
WGQGAQVTVSS (SEQ ID NO:10) 
WGQGTQVTASS (SEQ ID NO:ll) 
RGQGTQVTVSL (SEQ ID NO:12) 

ALQPGGYCGYGX C L (SEQ ID NO:62) 

VSLMDRISQH GC (SEQ ED NO:63) 

VPAHLGPGAILDLKKY K Y (SEQ ID NO:64) 

FCYSTAGDGGSGE MY (SEQ ID NO:65) 

ELSGGSCELPLLF DY (SEQ ID NO:66) 

DWKYWTCGAQTGGYF GQ (SEQ ID NO:67) 

RLTEMGACDARWATLATRTFAYNY (SEQ ID NO:68) 

QKKDRTRWAEPREW NN (SEQ ID NO:69) 

GSRFSSPVGSTSRLES-SDY NY (SEQ ID NO:70) 

ADPSIYYSILXIEY KY (SEQ ED NO:71) 

D S P C Y M P T M P A P P I R D S F G W - - D D ( SEQ ID NO:7 2) 



TSSFYWYCTTAPY N V (SEQ ID NO:73) 

TEIEWYGCNLRTTF TR (SEQ ID NO:74) 

NQLAGG WYLDPNYWLS VGAY-- A I (SEQ ID NO:75) 
RLTEMGACDARWATLATRTFAYNY (SEQ ED NO:76) 
DGWTRKEGGIGLPWSVQCEDGYNY (SEQ ID 
25 NO:77) 

DSYPCHLL D V (SEQ ID NO:78); 

and 

VEYPIADMC S R Y (SEQ ID NO:79). 

2. Process for the preparation of antibodies directed 
30 against determined antigens, comprising the steps of: 

cloning into vectors, especially into phages and more 
particularly filamentous bacteriophages, a DNA or 
cDNA sequence obtained from lymphocytes of Cam- 

35 elids previously immunized with determined antigens, 
capable of producing an immunoglobulin comprising 
two heavy polypeptide chains, each heavy chain con- 
sisting of a complete antigen binding site, said immu- 
noglobulin containing a variable <y HH ) region and a 
constant region, said constant region being devoid of 
first constant domain C w l. wherein the immunoglobu- 
lin is devoid of polypeptide light chains; 
transforming prokaryotic cells with said vectors in con- 
ditions allowing the production of the antibodies; 

45 selecting the appropriate antibody by subjecting the trans- 
forming cells to antigen-affinity selection; and recov- 
ering the antibodies having the desired specificity. 

3. Recombinant vector comprising a nucleotide sequence 
according to claim 1. wherein the vector is a plasmid, a 

50 phage especially a bacteriophage, a virus, a YAC. or a 
cosmid. 

4. Recombinant cell or organism modified by a vector as 
claimed in claim 3. 

5. An isolated nucleic acid encoding an immunoglobulin 
55 comprising two heavy polypeptide chains, each heavy chain 

consisting of a complete antigen binding site, said immu- 
noglobulin containing a variable (V HW ) region and a con- 
stant region, said constant region being devoid of first 
constant domain C w l. wherein the immunoglobulin is 
Go devoid of polypeptide light chains, wherein said nucleic acid 
comprises a nucleic acid fragment having a sequence 
selected from SEQ ID NOS: 92-108. 

6. A process for the preparation of antibodies directed 
against determined antigens, comprising the steps of: 

65 cloning into a vector, a DNA or cDNA sequence obtained 
from lymphocytes of Camelids previously immunized 
with determined antigens. 
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transforming prokaryotic cells with said vector in c 
tions allowing the production of the antibodies. 



112 

8. The process of claim 7 wherein said eukaryotic cell is 
selected from die group consisting of a yeast cell, mamma- 



selecting antibody by subjecting the transforming cells to lian cell, plant cell and protozoan cell. 

antigen-affinity selection, and 
recovering the antibodies having the desired specificity. 5 
7. The process according to claim 6. wherein the cloning 
vector is a plasmid or a eukaryotic virus and the transformed 

cell is a eukaryotic cell. ***** 



